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Smarter technology for a Smarter Planet: 


It's time to ask smarter questions. 


What exactly does a benchmark mean? For the last five years, IBM DB2® on Power Systems” has ranked 

first on three of the industry's leading performance benchmarks, longer than Oracle and Microsoft combined! 
But shouldn't we be asking our IT to deliver more than just raw performance? What really matters isn't some 
abstract measure of performance, it’s what companies actually do with it. For instance, Coca-Cola Bottling 
Company is using DB2 on Power to reduce licensing, maintenance and storage fees by $350,000. EuResist 
is using an integrated analytics solution to predict the most effective drug combinations for individuals 
with HIV, with 78% accuracy. And the Dubai Gold & Commodities Exchange is working with IBM Security 
Services to achieve system uptime of over 99.9%. On a smarter planet, these are the benchmarks that matter. 


A smarter business is built on smarter software, systems and services. 
Let's build a smarter planet. ibm.com/questions 
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፪ 
ጆ 
8 
-| 
5 
Di 
ê 
8 
፤ 
8 
3 
: 
E 
5 
Š 
3 
5 
ê 
| 


a ase 


| e | | 

rroring 
Server 2008 R | 

SQL Server 2008 ` 


/ | —Richard Waymiré 


| \ 
Database mirroring can be a great way to improve 


databáse availability because it's easy to ላ 


administer, and maintain 


FEATURES 
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Number of Rows and Logical መሪያ Build a Cube to Perform 
Ordering Scenario-Based Risk 
—Itzik Ben-Gan Analysis 
Use the TOP, FETCH FIRST, and TOP OVER filters to —Kristl Smith Tyler 
improve your T-SQL queries. Find out how to design and build a cube that can 
ù be used to simulate changing conditions for risk 
ፊያ SQL Server Virtualization analysis; 


and Availability 

—Michael Otey 

Learn how you can use Hyper-V (or other hypervisors) 
to increase your SQL Server system's reliability and use 
resources more efficiently. 


Editor’s Tip 
) E Using Performance Profiler Attending SQL Connections in 
to Troubleshoot ADO.NET Las Vegas November 1—4? Be 
Applications sure to stop by the SQL Mag 
—William Sheldon — booth---We'd love to chat with you! 
If you're a developer, you might find many perfor- —Sheila Molnar, executive editor 
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edge reports 
with the complete 
set of tools from 


Altova? 


Experience how the Altova MissionKit®, the 
integrated suite of XML, database, and data 


integration tools, lets you display and analyze data 
through enhanced chart and report generation. 


Report generation is finally easy — and affordable - 
with Altova MissionKit reporting tools: 


StyleVision® — stylesheet and report design tool 


* Consolidated reporting based on XML, 
database and/or XBRL data 
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* Dynamic data selection and rendering 
e HTML, Microsoft Word, PDF, and e-Form report creation 
በ XMLSpy® — advanced XML editor 

e Instant creation of charts to analyze XML data 

* One-click export of charts to XSLT, XQuery, or image files 
MapForce® — any-to-any data mapping tool 

e Integration with StyleVision for rendering attractive reports 
* Report generation for virtually any data format: 
XML, databases, EDI, flat files, Excel 2007+, and more 
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NoSQL? No Way! 


ne trend gaining traction in web development 
Oli. is the NoSQL movement. NoSQL 
recently received quite a positive bump when Twitter, 
the well-known social media website, announced it 
was moving from MySQL to the Java-based NoSQL 
Cassandra database. Twitter isn’t the only notable 
website using a NoSQL implementation—other sites 
include Facebook, Digg, and Rackspace. Despite all 
the buzz, many SQL Server professionals I’ve run 
across don’t really know what NoSQL is. Some think 
it’s a product, but most think it’s a competitor and 
potential threat to SQL Server and relational data- 
bases in general. They think it’s just the kind of non- 
sense that well-meaning but misguided application 
developers would come up with. In fact, the reality of 
what NoSQL is all about is quite different. 

In this column 1 ሽ dig into NoSQL and explain 
how it compares to relational databases such as SQL 
Server. Save this column for that elevator ride with 
management: When an executive says he or she has 
heard that NoSQL is the wave of the future, you'll 
be able to toss off a few key points explaining why 
NoSQL isn’t really a viable replacement for SQL 
Server. 


Not Only SQL 

First, some history: The term “NoSQL” was coined 
back in 1998, and it originally stood for “Not Only 
SQL.” So the name NoSQL is a bit of a misnomer. 
NoSQL began its life as an alternative form of struc- 
tured data storage. NoSQL databases are defined by 
more than the lack of a SQL query interface. In fact, 
NoSQL databases aren’t relational at all. Today’s 
NoSQL databases are distributed data stores that are 
designed for very large-scale data access requirements. 
Probably the most popular NoSQL database is the 
Cassandra open-source project. Cassandra began 
as Facebook’s proprietary database, but Facebook 
released it as open source in 2008. To learn more 
about NoSQL databases and the open-source Cassan- 
dra implementation, check out cassandra.apache.org. 
Other NoSQL implementations include Google Big- 
Table, Apache Hadoop, MapReduce, MemcacheDB, 
and Voldemort. 


NoSQL, Not Relational 

Handling huge amounts of data storage and data 
writes, such as Twitter’s 7TB per day, can be problem- 
atic for relational databases. NoSQL distributed data 
store implementations are able to scale horizontally 
by distributing data across multiple nodes. Greater 
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degrees of scaling are achieved by adding more nodes. 
To learn more about the rationale behind Twitter’s 
move to NoSQL, take a look at the slide show at 
www.slideshare.net/kevinweil/nosql-at-twitter-nosql- 
eu-2010. 


NoSQL, No ACID 

It’s important to remember that even though NoSQL 
databases are designed for performance and scal- 
ability, they aren't designed to support the atomic- 
ity, consistency, isolation, and durability (ACID) 
properties that are standard in relational databases 
such as SQL Server. This is the critical tradeoff 
for NoSQL databases. They eschew the overhead 
that it takes to guarantee data integrity in favor of 
the ability to scale and handle massive amounts of 
data. Data integrity is the foundation for almost all 
business-oriented relational database applications. In 
addition, relational databases are designed to support 
queries and reporting; these aren't typically important 
requirements for NoSQL implementations. Relational 
database developers have come to take for granted the 
standardized SQL data access language that works 
with most enterprise relational database platforms. 
Although there are a few differences among the 
vendors of SQL implementations, there are far more 
SQL similarities. Not so in the NoSQL realm; each 
different NoSQL implementation uses a proprietary 
data access mechanism. 


NoSQL, No Fear 
SQL Server DBAs don’t need to fear that NoSQL 
represents some new technology wave that will 
sweep away relational databases such as SQL Server. 
Instead NoSQL is just another tool in the application 
developer’s toolbox. Just as you can’t use a hammer 
to fix every job around the house, you can’t expect a 
relational database to be the right choice for every 
data access application. NoSQL implementations 
solve a data storage problem that relational databases 
weren't really designed to address. So the bottom line 
on NoSQL is that NoSQL isn’t a technology that’s 
going to replace relational database systems such as 
SQL Server—except in very specialized instances. 
SQL Server and other relational databases will remain 
at the core of the vast majority of enterprise data- 
base applications for the foreseeable future. Do you 
agree? What's your take on NoSQL? Td like to hear 
from you. Drop me a line at letters@sqlmag.com or 
motey@sqlmag.com. ፳፲ 
InstantDoc 1D 125778 
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Execute a Stored Procedure 
Within a Query 


EXEC statement. However, you can execute 

a stored procedure implicitly from within a 
SELECT statement, provided that the stored pro- 
cedure returns a result set. The OPENROWSET 
function is key to this technique, which involves three 
steps. 


ኢግ tored procedures are typically executed with an 


Step ፤ 

By default, SQL Server doesn't allow ad hoc 
distributed queries using OPENROWSET. Thus, 
the first step is to enable the Ad Hoc Distributed 
Queries configuration option on the SQL Server 
machine from which you'll be executing the query. 
If the option is disabled on your machine, you can 
use the EnableOption.sql script to enable it. You 
can download this script and the other code dis- 
cussed here by going to www.sqlmag.com, entering 
125824 in the InstantDoc ID text box, clicking Go, 
then clicking the Download the Code Here button. 
After running EnableOption.sql, you can confirm 
that the option is enabled by looking at the script's 
output. It should include the statement Configura- 
tion option ‘Ad Hoc Distributed Queries’ changed 
from 0 to 1. 


Step 2 

The next step is to create a view that provides the same 
result as the stored procedure you want to execute in 
queries. You can then use that view in a SELECT 
statement. I created the sp_ConvProc2View stored 
procedure in Listing 1 to transform stored procedures 
into views. 

As callout A shows, sp_ConvProc2View requires 
two parameters: the name of the stored procedure you 
want to transform into a view (@procName) and the 
name you want to give that view (@viewName). It 
passes those parameters to the OPENROWSET func- 
tion, which it executes with dynamic SQL. Within a 
string, dynamic SQL is slower than ordinary SQL and 
more prone to SQL injection attacks. Because of the 
latter, you need to make sure that any parameters you 
pass to sp_ConvProc2View (and hence OPENROW- 
SET) are properly edited to minimize SQL injection 
attacks. 

As callout B shows, sp_ConvProc2View uses the 
SQLOLEDB provider along with a trusted con- 
nection (Windows authentication) to connect the 
default SQL Server instance (SERVER=.) and run 
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OPENROWSET against it. If you want to use a 
named instance, you need to replace the period with 
the name of the instance or modify the code so that 
the instance's name is provided by a parameter to the 
procedure. The SET FMTONLY OFF statement at 
the end of callout B ensures that the results (and not 
just the metadata) will be output. 

Let's look at a couple of examples of how to use 
sp_ConvProc2View. Note that the view specified with 
the @viewName parameter must not already exist. If 
it does, an error will occur. 

Suppose you want to transform the sp_lock sys- 
tem stored procedure (which provides information 
about locks) into a view named v$Lock. In this case, 
you'd run 


EXEC sp_ConvProc2View 
@procName = 'sp_Tock', 
@viewName ='v$Lock' 


If you want to create a view named v$Session from 
the sp_who system stored procedure (which provides 
information about current users, sessions, and pro- 
cesses), you'd run 


EXEC sp_ConvProc2View 
@procName = 'sp_who', 
@viewName ='v$Session' 


Step 3 
After you create the view, you can use it ina SELECT 
statement. You can make the SELECT statement 
as simple or complex as needed. I'll show you three 
examples that use the v$Lock and v$Session views 
created in step 2. Note that the code in these examples 
is case sensitive. 

Example 1: A simple SELECT statement. If you 
just want to see the locks in all the currently active 
sessions, you can run the statement 


SELECT * FROM V$Lock 


You can add a WHERE clause if you want to filter 
or group data. For example, if you want to see only 
those sessions running a background task, you can 
run the code 


SELECT * FROM v$Session 
WHERE Status LIKE '%background%' 


Eli Leiba 


(iecdba @ hotmail.com) is a senior application 


DBA at Israel Electric Company, a teacher 
at Microsoft CTEC, and a senior database 
consultant for 2Cher IT Professionals. He's 
certified in Orade and SQL Server database 
administration and implementation. 
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Share your SQL Server 
code, comments, discoveries, 
and solutions to problems. 
Email your contributions to 
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include your full name 
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you'll get $100. 
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Example 2: A complex SELECT statement. To 
obtain more in-depth information, you can join views 
in a SELECT statement, thereby implicitly running 
multiple system stored procedures at the same time. 
For example, you can join the v$Lock, v$Session, and 
sys.objects views to obtain detailed information about 
the currently active sessions that contain locks with 
code such as 


SELECT S.status,S.dbname,S.cmd,0.name, 
L.Type,L.Mode,L.Status 

FROM v$Session S JOIN v$Lock L ON 
S.spid = L.spid JOIN sys.objects O 

ON L.ObjId = O.object_id 


For each locked process in a session, this query 
returns: 

e The status of the involved process 

9 The name of the database used by that process 
9 The type of command executing the process 

e The name of object that's locked 

9 The type of lock 

e The lock mode requested 

e The status of that lock request 


Example 3: A SELECT statement whose results 
are loaded into a table. You can use code that creates 
and executes a SELECT statement, then loads the 
SELECT statement's results into an output table. I 
created the sp_OutputAndFilterResults stored pro- 
cedure for this purpose. After creating and executing 
a SELECT statement, it loads the results into a tem- 
porary output table. This stored procedure takes three 
parameters, which are case sensitive: 


LISTING |:sp_ConvProc2View.sql 


CREATE PROCEDURE sp_ConvProc2View 
(@procName varchar (89), @viewName varchar(80)) 


BEGIN 


DECLARE @TSQLStmt nvarchar (599) 
SET NOCOUNT OFF 
(B) SET @TSQLStmt = N'CREATE VIEW ' + @viewName + 


N' AS SELECT * 


FROM ' + 


"OPENROWSET ር ' + '''' + 'SQLOLEDB' + '''' + ',' + 


+ 'SERVER=.;Trusted_Connection=yes' + md + ','+ 


1.0% 4 'SET FMTONLY OFF EXEC ' + @procName + '''' + "ን" 
EXEC sp_executesql @TSQLStmt 
SET NOCOUNT ON 


END 
GO 
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LISTING 2: SampleScript.sql 


EXEC sp_ConvProc2View 
@procName = 'sp_who', 
@viewName ='v$Session' 


EXEC sp_OutputAndFilterResults 
@ViewName ='v$Session', 
@OutputTable = 'output_sess', 
@WhereClause = 'DBname = ''master''' 


GO 
SELECT * FROM output_sess 


9 The name of the view (@ViewName). 

9 The name of the output table (@Output- 
Table). This name needs to be unique. If 
a table by that name already exists, 1t'11 be 
dropped by the stored procedure. 

e A filter condition (@WhereClause). Specifying 
a filter condition is optional. When one isn’t 
specified, @WhereClause is assigned a NULL 
value. 


Using this information, the sp_OutputAndFilter- 
Results stored procedure creates a SELECT state- 
ment, assigning it to the @TSQL variable. Then, if 
there’s a filter condition, the stored procedure adds 
it to the SELECT statement. After using PRINT 
to display the command in @TSQL for debug pur- 
poses, the stored procedure dynamically executes that 
command. 

To execute sp_OutputAndFilterResults, you use 
code such as 


EXEC sp_OutputAndFilterResults 
@ViewName ='v$Session', 
@OutputTable = 'output_sess', 
@WhereClause = 'DBname = ''master''' 
SELECT * FROM output_sess 


This code returns all the currently active sessions that 
are using the master database and stores them in a 
temporary table named output_sess. 


Putting It All Together 
You can combine the code in steps 2 and 3 into one 
script. For example, the script in Listing 2 first uses 
sp_ConvProc2View to transform the sp_who system 
stored procedure into the v$Session view. With the 
help of sp_OutputAndFilterResults, the script then 
uses the v$Session view in a SELECT statement to 
find the currently active sessions that are using the 
master database. It stores that information in the 
output_sess table. This script assumes that: 

e You already enabled the Ad Hoc Distributed 
Queries configuration option. 

e You already ran the code to create the sp_ 
ConvProc2View stored procedure. 

e You already ran the code to create the sp_ 
OutputAndFilterResults stored procedure. 

e You deleted the v$Session object if you ran 
sp_ConvProc2View to transform the sp_who 
stored procedure into the v$Session view in 
step 2. (Remember that the view specified with 
the @viewName parameter must not already 
exist.) 


All the code I've discussed here works on SQL Server 
2008 and SQL Server 2005. SQL 
InstantDoc ID 125824 
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Microsoft Network Monitor 
Capture, view, and analyze information about 


your network traffıc 


any times when I choose to write about a 

SQL Server utility in the Tool Time col- 
umn it’s because a friend or colleague specifically 
points the tool out to me. Other times I write the 
column because the creator, a SQL Server enthu- 
siast, or perhaps a program manager at Microsoft 
reaches out to me to discuss the tool. But in the 
case of this month’s tool, I had a specific need for 
a tool and found that this excellent free tool fit the 
bill perfectly. 

I wanted to see how much network bandwidth 
was consumed specifically by SQL Server answer- 
ing requests from a SQL Server Reporting Services 
(SSRS) application. All of the standard tools, such as 
the PerfMon counters for network utilization, weren’t 
detailed and granular enough to tell me what I needed 
to know. That’s where the Microsoft Network Moni- 
tor comes in. It lets you capture detailed information 
about network traffic, view it, and analyze it. 

In addition, you can download (or even write 
your own) specific parsers, such as the parsers for 
SQL Server, which provide ready-made templates 
to determine how the network is being used in just 
about any and every conceivable context. It’ll answer 
lots of questions about SQL Server network utiliza- 
tion that might otherwise be very hard to answer. 
For example, you can use it to determine how much 
bandwidth a particular web application uses, how 
much traffic moves between SharePoint and SQL 
Server, how much SQL Server traffic is replication 
compared to the total network traffic, or whether the 
reporting front end to an application consumes lots 
of bandwidth. 


Features 

I recommend starting the process of learning about 
Network Monitor at the frequently updated Network 
Monitor blog at blogs.technet.com/b/netmon/. From 
there, you can download Network Monitor from 
the Microsoft article “Information about Network 
Monitor 3” at support.microsoft.com/?kbid=933741 
or directly from TechNet at www.microsoft.com/ 
downloads/details.aspx?displaylang=en&FamilyID 
=983b941 d-06cb-4658-b7f6-3088333d062f#filelist. 
In addition, the parser template that’s specifically 
matched to SQL Server is available at nmpars- 
ers.codeplex.com. Network Monitor's key features 
include 
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e An intuitive and easy-to-use GUI 

e Command-line functionality to better sup- 
port scripting through NMCap.exe (for more 
information, see blogs.technet.com/b/netmon/ 
archive/2006/10/24/nmcap-the-easy-way-to- 
automate-capturing.aspx, with a training 
video available at blogs.technet.com/.../nm3- 
tv-video-help-for-using-nm3.aspx) 

e Concurrent live capture sessions 

* High-performance filtering, which can reduce 


Kevin Kline 


(kevin.kline@ quest.com) is the director 


of technology for SQL Server Solutions 


the overall overhead of Network Monitor at Quest Software and a founding board 
while capturing heavy network traffic loads member of the international PASS. He 

e A script-based parser model with frequent is the author of SQL in a Nutshell, 3rd 
up dates edition (O'Reilly). 


* The ability to monitor network conversations 
between client requests and server responses 
and process tracking 

e Color-rules to differentiate network traffic by 
client, server, application, local traffic, port, 
and more 

e An open framework and API to access the 
capture and parsing engine, plus the ability to 
write your own parsers (you can refer to the 
Network Monitor Expert Analysis Tools at 
go.microsoft.com/fwlink/?LinkID=133950 for 
examples of plug-ins for Network Monitor) 

e Wireless Monitor Mode Capturing 


System Requirements 
Network Monitor needs fairly substantial system 
resources, which is a big reason that Microsoft 


MICROSOFT NETWORK 
MONITOR 


Benefits: Microsoft Network Monitor lets 

you monitor essential aspects of SQL Server 
network consumption, including the traffic 
created by specific backups, applications, and 
database operations. 


System Requirements: 1GHz or faster pro- 
cessor; 1GB or more memory; at least GOMB 
available space on the hard disk; Windows 7, 
Windows Server 2008, Windows Vista, Windows 
XP, or Windows Server 2003 


How to Get It: You can download the Micro- 
soft Network Monitor from support.microsoft 
.com/?kbid=933741. 
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recommends you refrain from running it on pro- 
duction systems. In terms of hardware, minimum 
suggested specifications include a 1GHz or faster 
processor, 1GB or more of memory, and at least 
60MB available space on the hard disk, with much 
more additional hard disk space available to store the 
captured files. Network Monitor runs on Windows 7, 
Windows Server 2008, Windows Vista, Windows XP, 
and Windows Server 2003, and it takes advantage of 
the available features in each. 

Note that if you take the defaults when captur- 
ing a Network Monitor trace, the captured files can 
grow to be quite enormous. (Network Monitor will 
stop capturing data when less than 2 percent of the 
disk remains free.) It can also be memory intensive, 
particularly if you leave the Conversation feature 
enabled, which is the default. 

Network Monitor is well documented and sup- 
ported by an active and enthusiastic community that 
quickly responds to questions asked in the forum. In 
addition, you can find support for this tool on the 
following Microsoft websites: 


* Network Monitor TechNet forum (go 
.microsoft.com/fwlink/?LinkID=151800) 

e Network Monitor blog and landing page (go 
-microsoft.com/fwlink/?LinkID=92890) 

* Network Monitor FAQ and wiki (social 
.technet.microsoft.com/wiki/contents/ 
articles/network-monitor-3-frequently- 
asked-questions.aspx) 

e Channel 9 Network Monitor videos 
(channel9.msdn.com/tags/Netmon/) 

* Network Monitor Open Source Parsers 
(where you can download the latest Micro- 
soft SQL Server parsers) (go.microsoft.com/ 
fwlink/?LinkID=125105) 

* Network Monitor Expert Analysis Tools (go 
-microsoft.com/fwlink/?LinkID=133950) 


As always, Id love to hear your feedback about 
this tool and others on the Tool Time with Kevin 
Kline discussion forum at www.sqlmag.com/go/ 
ToolTimeForum. SQL] 
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Can 1 Restore My Enterprise 
Edition Database on All 
Editions of SQL Server? 


°m designing a disaster recovery strategy for 

SQL Server 2008 and want to make sure I’ve 

got an alternative server to restore my critical 
databases on. I’d prefer to use Standard Edition to 
save on licensing costs, but will it be a problem if Pm 
using Enterprise Edition for the production server? 


Yes, this might be a problem. In SQL Server 2005 
and later, some Enterprise Edition—only features 
make a database Enterprise Edition—only (1.6., it 
can’t be restored on a Standard Edition or lower 
SQL Server system) if the features are present in 
the database. 

In SQL Server 2005, the only feature to which 
this rule applies is table/index partitioning. There’s 
no easy way to tell whether you have partition- 
ing in a database without querying the storage 
metadata to see if any tables/indexes have multiple 
partitions. 

In SQL Server 2008, there are four features to 
which this rule applies: 

1. table/index partitioning 

2. transparent data encryption 

3. change data capture 

4. data compression 


Figure 1 shows an example of an error message 
you'd receive when trying to restore a SQL Server 
2008 database with data compression to a SQL 
Server Express instance using the following code: 


RESTORE DATABASE EnterpriseOnly 
FROM DISK = 'C:\SQLskills\ 


PaulsEnterpriseOnlyDB.bck' 
GO 


Because SQL Server 2008 contains more fea- 
tures that can trip you up, it includes a Dynamic 
Management View (DMV) called sys.dm_db_ 
persisted_sku_features that will tell you if you 
have any of these features in your database. For 
example, if you run the code 


SELECT * FROM sys 
.dm_db_persisted_sku_features; 
GO 


you'll receive the following output, which shows 
you have data compression enabled: 


feature name feature id 


Compression 


Other than data compression, all of the SQL 
Server 2008 R2 and SQL Server 2008 Enterprise Edi- 
tion features that make a database Enterprise Edition- 
only require essentially database owner (DBO) or 
systems admin permissions, but data compression 
requires only the ALTER TABLE permission on the 
table involved. This means a table owner can make a 
database Enterprise Edition-only without the DBA 
realizing it. Therefore, you must make sure that no 
one can enable these features without you knowing 
it if you want to be able to restore your database to a 
non-Enterprise Edition SQL Server instance. 


Msg 3167, Level 16, State 1, Line 1 


Msg 3013, Level 16, State 1, Line 1 
RESTORE DATABASE is terminating abnormally. 
Msg 999, Level 21, State 1, Line 1 


Processed 169 pages for database 'PaulsEnterpriseOnlyDB', file 'PaulsEnterpriseOnlyDB' on file 1. 
Processed 5 pages for database 'PaulsEnterpriseOnlyDB', file 'PaulsEnterpriseOnlyDB_log' on file 1. 


RESTORE could not start database 'PaulsEnterpriseOnlyDB'. 
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Paul §. Randal (paul @ SQLskills.com) 

and Kimberly L. Tripp (kimberly@ 
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world-renowned SQL Server consulting and 
training company. They're both SQL Server 
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SQL Server experience. 


ሠ Editor's Note 


Check out Kimberly and 
Paul's blog, Kimberly € 
Paul: SQL Server Questions 
Answered, at www.sqlmag 
.com/blogs/SQLServer 
QuestionsAnswered.aspx. 


Database 'PaulsEnterpriseOnlyDB' cannot be started in this edition of SQL Server because part or all 
of object 'compressed' is enabled with data compression or vardecimal storage format. Data 
compression and vardecimal storage format are only supported on SQL Server Enterprise Edition. 

Msg 933, Level 21, State 1, Line 1 

Database 'PaulsEnterpriseOnlyDB' cannot be started because some of the database functionality is not 
available in the current edition of SQL Server. 


Figure | 


Error message returned when attempting to restore a SQL Server 2008 database with data compression to a 
SQL Server Express instance 


SQL Server Magazine * www.sqlmag.com October 2010 11 


QUESTIONS 
ANSWERED 


Determining How Long It 
Takes for DBCC CHECKDB 


to Run 


sure that running DBCC CHECKDB will 

be part of my disaster recovery plan. How 
can 1 estimate how long it will take for DBCC 
CHECKDB to run? 


1 f I ever have to perform disaster recovery, Pm 


There's a very simple answer to your question—run 
it and see how long it takes. However, there's obvi- 
ously a lot more to it than that, so let me explain. 
As far as disaster recovery plans go, running 
DBCC CHECKDB is the step you need to take 
before deciding on a course of action, such as 
performing a restore or running a database repair, 


DBCC CHECKDB is extremely resource 


hungry—if the server is heavily loaded, 


DBCC CHECKDB will be competing for 


resources and take longer to run. 


12 October 2010 


because there might not be any data loss or other 
damage to the database that necessitates a drastic 
course of action. The only tool that can tell you 
the extent of the damage to the database is DBCC 
CHECKDB, so you really don’t have any choice 
but to run it. 

Given that you're going to be running DBCC 
CHECKDB, it’s a good idea to know how long 
it’s going to take to run for two reasons. First, if 
it takes longer than usual, that’s a sign that it has 
found corruption and is determining exactly where 
the corruption is. Second, if your boss comes along 
during the disaster recovery process and asks how 
long it will be before you know the extent of the 
damage, it’s far better to be able to say, “Well, it 
usually takes X hours to run” rather than “I have 
no idea, boss—I never run DBCC CHECK DB.” 

As far as estimating how long DBCC 
CHECKDB will take to run on a given database, 
it’s very difficult to tell because there are so many 
variables involved. (I even have a hard time esti- 
mating for large databases, and I wrote DBCC 
CHECKDB)) The following are some factors that 
affect DBCC CHECKDB’s run time: 

e The size of the database. This one’s not so 
obvious—it’s not the size of the database 


that matters, it’s the amount of data that’s 

in it. A 1TB database with only 100MB of 
data can be consistency checked very quickly, 
but if the same database contained 1TB of 
data, it would take a lot longer for DBCC 
CHECKDB to process. 

e The load on the system. DBCC CHECKDB is 
extremely resource hungry—I like to say it’s 
the most resource-intensive operation you can 
run on SQL Server. Therefore, if the server is 
already heavily loaded, DBCC CHECKDB 
will be competing for resources and will take a 
lot longer to run. 

* The capabilities of the system. If the database 
being consistency checked is very large and 
structurally complicated, but the server and/ 
or I/O subsystem are heavily underpowered, 
this will have a knock-on effect on the ability 
of the server to provide the resources DBCC 
CHECKDB needs, slowing it down. 

e The options specified. If the WITH PHYSI- 
CAL_ONLY option is specified, the amount 
of processing that DBCC CHECKDB does is 
drastically cut down, which usually leads to a 
significant reduction in run time. However, I 
wouldn’t recommend using this option during 
disaster recovery. 

e The complexity of the database schema. The 
more features that you use in the database, the 
more structures there are to be consistency 
checked, so DBCC CHECKDB will take 
longer to run. 

e The corruptions that are found. Some corrup- 
tions require deeper reprocessing of data to 
figure out exactly where the corruption is. This 
can lead to a much longer run time for DBCC 
CHECKDB. 

e The tempdb configuration. DBCC CHECKDB 
uses a lot of memory to store intermediate 
consistency checking data, and that storage 
usually spills out to the tempdb database. If 
tempdb isn’t configured well, it can be a bottle- 
neck for DBCC CHECKDB and slow it down. 


As you can see, there are too many factors involved 
to be able to make a good guess. The best way to 
know how long DBCC CHECKDB will take is to 
run it. [SQL] 
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Improve database availability 
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eeping the database available for customer 
use is one of a DBAS primary concerns. 
any technologies are available to assist 
DBAs with availability, both for single-server and mul- 
tiple-server configurations. Database mirroring can be 
a great way to improve database availability because 
it’s fairly easy to deploy, administer, and maintain. In 
this article I explain the fundamentals of mirroring, 
including how to determine whether it's the right 
choice for your environment and how to get a basic 
configuration up and running. Because SQL Server 
2008 R2 doesn’t include any significant enhancements 
to database mirroring, this article applies to both SQL 
Server 2008 R2 and SQL Server 2008. 


Overview 
Database mirroring is one of SQL Server’s high- 
availability options. It’s the newest addition to SQL 
Server’s multiserver family of capabilities, first intro- 
duced in SQL Server 2005. As implied by its name, 
database mirroring lets you keep a remote copy of 
a single database up-to-date with changes from the 
primary copy of your database. The mirroring is 
done one database at a time rather than at the server 
level. Typically, this means it’s much easier to set 
up a mirrored configuration than using other high- 
availability technologies, such as failover clustering. 
The disadvantage of mirroring working at the 
database level is that system databases, such as mas- 
ter, model, tempdb, and msdb, aren’t mirrored (and 
in fact can’t be mirrored). This means logins, SQL 
Server Agent jobs, and other security configurations 


erver 


) 
a R2 


12008 


y 


aren't identical between the two servers. The result is 
that a bit more administrative overhead is necessary 
to make sure customer connectivity works on both 
the servers involved in mirroring; however, you have 
more flexibility in the use of both systems. 

Database mirroring involves a principal server 
and a mirror server, as well as an optional witness 
server. The witness server doesn't participate in mir- 
roring per se—it just helps decide when an automatic 
failover can occur, under very specific circumstances. 
The principal and mirror roles for each database swap 
when a failover occurs. 

The two types of mirroring are synchronous and 
asynchronous. In synchronous mirroring, also called 
full transaction safety, a transaction on the principal 
database isn't considered to be committed until a 
copy of the transaction log record for that transac- 
tion is physically copied over the network to the 
mirror server. Thus, you can guarantee no data loss 
in a synchronous configuration. In an asynchronous 
configuration, also referred to as having transac- 
tional safety turned off, the log records are sent to 
the mirror server but aren't guaranteed to be in sync. 
Thus, some data loss can result if a failure occurs on 
the primary system before the committed transaction 
log records are copied to the mirror server. 

Within synchronous mode, you can select either 
automatic or manual failover. In manual failover 
mode, the administrator can cause a failover to the 
mirror partner database with a simple T-SQL com- 
mand or via the SQL Server Management Studio 
(SSMS) GUI. With automatic failover, you can still 
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perform a manual failover, but if failure conditions 
are detected, then a failover to the mirror partner 
will happen without administrative intervention. 
Automatic mode requires a third server, the witness, 
be involved. A witness server doesn't have a copy of 
the data, unlike the other two servers in a mirrored 
configuration. For an automatic failover to occur, the 
witness must agree with the mirror server that they're 
both unable to make a client connection to the data- 
base on the principal server. If they both agree that 
the principal database server is unavailable, the data- 
base on the mirror server is recovered and becomes 
available for client connections, thereby acting as the 
principal server. 

Although synchronous mirroring requires SQL 
Server Standard Edition or better, a witness can run 
on any edition of SQL Server. However, if the wit- 
ness server fails for some reason, both the primary 
server and the mirror server must be running. If one 
of them fails, the database will become unavailable 
until you manually intervene. Randomly selecting a 
noncritical server to serve as the witness for a data- 
base mirror configuration isn't a good idea, because 
your availability is only as good as your witness 
server. 

Before we dig any further into mirroring, I want 
to stress a couple of configuration requirements. As 
I stated previously, SQL Server 2008 R2 Standard, 
Enterprise, or Datacenter is required for synchronous 
mirroring (high safety mode). Both the principal 
server and the mirror server must be running the 
same edition of SQL Server. SQL Server 2008 R2 
Enterprise Edition or higher is necessary for asyn- 
chronous mirroring. The database you're mirroring 
must be in full recovery mode and mustn't be using 
the FILESTREAM feature. 

Although your mirror server can act as a backup 
for your principal mirrored server, you should have 
the same hardware resources (memory, CPU, disk 
subsystem) for both servers. The mirror server 
requires roughly the same amount of space for the 
database as the principal server, and in the event of 
a failover the mirror server becomes the production 
machine. Service level agreements (SLAs) for this 
database might or might not allow for degraded 
performance after a failover--you must make that 
determination when you size the hardware. 


Design and Configuration 

The first question to ask is whether database mir- 
roring is a good fit for the availability goals you 
established for your database. Mirroring can be a 
good fit if you need only a single database and if 
your entire schema is contained in one database. But 
if your design includes parts of the logical schema in 
two or more physically separate databases on your 
server, mirroring probably isn't the best solution. In 
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that case, you'd want a solution that keeps multiple 
databases at a time highly available—failover clus- 
tering is the only such technology that's built in to 
SQL Server. 

If your entire logical schema is in a single data- 
base, the next thing to consider is whether you need 
synchronous or asynchronous mirroring. Can your 
business afford the possibility of data loss? If not, you 
should use synchronous database mirroring. 

An additional consideration for synchronous 
mirroring is the network latency between the two 
mirrored servers. The more latency that exists between 
the two servers, the more the principal transaction 
server's performance will be affected (because no 
commit is final until the log records are physically 
secured on the mirror server). Therefore, a high-speed 
network connection is essential between the principal 
and mirrored servers for synchronous mirroring to be 
a viable solution. 

As a side note, database mirroring in asynchro- 
nous mode could be handy for secondary availability 
protection or for disaster recovery. If you have a 
local failover cluster (perhaps with some sort of SAN 
mirroring for the physical data files as well), then 
you might consider asynchronous log shipping to a 
remote site for additional protection or for disaster 
recovery purposes. Some risk of data loss still exists— 
but in an extreme scenario in which an entire data 
center becomes unavailable, a small amount of data 
loss might be acceptable. 


Security Considerations 

You need to keep several security considerations 
in mind when configuring database mirroring. The 
first thing to consider is that under the hood, Service 
Broker endpoints are used to establish database 
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Setting the AdventureWorks database to full recovery mode 
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Figure 2 
Using the NORECOVERY option during an SSMS restore 


File 


Q anageme dio 
Edit View Debug Tools Window Community Help 


i 2. New Query | D AAA 


a 


rudustn 1Yogshipprimary (SQL Server 10,50. 1600 
Databases 


Adventure New Database... 
Adventure] New Query 
[8 [Y Adventure 
Ant Script Database 552. ኑ 
BI [Y MirrorDB Tasks a በጩ-- 
Teamcen! 
El [| timekeepir Policies ፦ 
E [A Security Ft 
El [A Server Objects Û E 2 
E] Replication Start PowerShell > 
[3 Management 
El 05 SQL Server Ag Reports ፦ Back Up... 
በ Restore » 
Delete Mirror... 
Refresh Launch Database Mirroring Monitor... 
Properties Ship Transaction Logs... 
Generate Scripts... 
Extract Data-tier Application... 
Register as Data-tier Application... 
Import Data... 
Export Data... 
4 ›| 
Copy Database... 
Ready አ 
Manage Database Encryption... 
Figure 3 


Launching database mirroring 
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nfigure Database Mirroring Security Wizard 


Principal Server Instance 


Specify information about the server instance where the database was originally 
located. 


Principal server instance: 
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Specify the properties of the endpoint through which the principal server instance will accept 
connections from the mirror and witness server instances: 


Listener port: IV Encrypt data sent through this endpoint 
int name: 


Mirroring 


NOTE: If the principal, mirror or witness are instances on the same server, 
their endpoints must use ports. 


Hep | <Back [[ Next> | tus | Cancel Û 
Figure 4 
Con 


figuring the principal server instance 


mirroring connectivity. Each of these endpoints 
has a TCP port assigned, and you must ensure that 
your network and firewall allow communication to 
occur on those ports. If you create the endpoints 
yourself (using the CREATE ENDPOINT T-SQL 
statement), select the TCP listener port. If you cre- 
ate the endpoints via SSMS, an unused TCP port 
15 suggested for use. You must also specify whether 
or not you want all mirroring transmissions to be 
encrypted. You'll trade off security for more CPU 
to perform the encrypt/decrypt operations on either 
side of the mirroring session. SSMS encrypts the 
connection by default. 

Another security consideration is that the servers 
must be able to establish a secure communication 
channel. The best solution is if both servers are in 
the same domain or in domains that have a trust 
relationship. If so, you can simply use Windows 
authentication to move the data. If your servers aren't 
in a valid domain relationship and you still want to 
mirror the database to the remote server, you must use 
certificates to establish the mirror session's logins and 
encryption. For more information about how to use 
certificates in this scenario, see “Setting Up Database 
Mirroring Using Certificates (Transact-SQL)” (msdn 
.microsoft.com/en-us/library/ms191140.aspx). Keep 
in mind that if your database clients are using Win- 
dows authentication, they won't be able to connect 
to the mirror server, either. Note also that if you use 
a witness server, the witness server must be able to 
make secure connections to both the principal server 
and the mirror server. 

Logins, linked servers, etc., are stored in the master 
database. Because database mirroring mirrors only a 
single database, none of the contents of the master 
database automatically appear on the mirror server. 
When you set up mirroring, you must ensure that the 
relevant logins for your database clients also exist 
on your mirror server. Any additional SQL Server 
components that are used outside your database must 
also be manually copied to the mirror server (e.g., 
linked servers, CLR objects, jobs, alerts, database 
mail configurations). Many ISV applications don't 
have any of these dependencies and are therefore 
great candidates for simple mirroring. But if you do 
take advantage of these features in your applications, 
be sure you account for them when you establish 
database mirroring. 


Initializing Mirroring 

After you have the security configuration in place 
for your mirroring session, you must move a copy 
of your database from the principal server to the 
mirror server. The easiest way to do so is to back up 
the principal database, then use the NORECOVERY 
option to restore that database on the mirror server. 
If you perform any log backups after the database 
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backup, you must also restore those transaction log (HA N EN TED NEY DITE 
backups on the mirror server. When you're done, the Service Accounts 

3 3 Specify the service accounts of the server instances. 
mirrored server's database should be recovered to as e i 


close to the current time as possible so that the log 


For SQL Server accounts in the same domain or trusted domains, specify the service accounts 


records still exist on the principal server's copy of below. if the accounts are non-domain accounts or the accounts are in untrusted domains, 
leave the textboxes empty. 


the database. 

To illustrate database mirroring, let's step through 
the SSMS GUI and establish a mirrored database Service accounts for the following instances: 
for SQL Server. 11 use the AdventureWorks sample 
database in my example; you can download this data- 
base at msftdbprodsamples.codeplex.com/releases/ 
view/45907. My example is for a synchronous mir- በር DEK ኣን wines ረ 
roring session with a witness (i.e., automatic failover and will be granted CONNECT pemission on the endpoints. 
allowed). The AdventureWorks database is in simple 
recovery mode by default, so my first step is to switch 
1t to full recovery mode, as Figure 1 shows. 

Next, you need to create a full backup of the e | sek |[ ne> | mm | ርመ› | 
AdventureWorks database and copy that backup 
to the server you want to use for your mirror. This 
process might take a while for a real database, which 
is likely to be significantly larger than the Adventure- 
Works database. Remember to use the NORECOV- 
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will launch on the mirroring tab. Click Configure 
Security to start the Configure Database Mirroring 
Security Wizard. Click Next, specify that you want to Figure 6 
configure a witness server, and click Next again. The 
dialog box in Figure 4 launches, to let you configure 


Start Mirroring Do Not Start Mirroring 


Starting the mirroring process 


the principal server instance. Database Properties 
Notice that you can change the port number, but y | GES See eT ea 
ጅማ © Additional information: 
SSMS attempts to select a port that isn’t in use. You L> Alter failed for Database 'AdventureWorks'. (Microsoft.SqlServer.Smo) 
can also change the endpoint name. Encryption is A ሙር 
enabled by default. Click Next to launch the dialog JR SE a የ ረ Kan መ cenan EE 
$ . local and ir tional. ft Server, Error: 1418) 
box that lets you select your mirror server instance EE De ው 1418) 


(which looks very similar to Figure 4). You can a 9: Cox] 
configure the port, encryption, and endpoint name. 
As long as you're using two different servers for the 
configuration, you can use the same listener port | Configuration error dialog box 

and endpoint name on both sides of the connection. 

Click Next to launch the dialog box that lets you con- LISTING I: T-SQL Code to Create the 


figure the witness server. Click Next again to specify Database Mirroring Endpoints 


Figure 7 


the service accounts for each instance of SQL Server, -- Run on the principal server 
1 1 > CREATE ENDPOINT Mirroring 
as Figure 5 shows. If the service accounts aren t the Sete = SAD 
same on each instance in your mirror configuration, AS TCP ር LISTENER_PORT = 5922 ን 
you can specify for SSMS to add logins to each POR DATABASE MIRRORING ከ ARTNER) 
instance on each server. However, using the same -- Run on the mirror server 
E . NÛ CREATE ENDPOINT Mirroring 
service account for each instance of SQL Server is STATE = STARTED 
Meal AS TCP ር LISTENER_PORT = 5922 ን 
recommended to keep the security simple, FOR DATABASE MIRRORING (ROLE=PARTNER) 
Click Next to obtain a summary dialog box of the 
4 : -- Run on the witness server 
actions to be performed on your behalf, then click CREATE ENDPOINT Mirroring 
Finish to run the appropriate T-SQL commands. The STATE = STARTED 
ê . me AS TCP ር LISTENER PORT = 5922 ን 
T-SQL code that’s generated will look similar to the FOR DATABASE_MIRRORING (ROLE=WITNESS) 


code in Listing 1. 
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DATABASE MIRRORING IN SQL SERVER 2008 R2 


LISTING 2: Code to Configure Login Security and 


Start Mirroring 


-- Run on all three servers. 
-- This assumes everyone is using the same service account. 
-- If not, create a login and grant connect privileges for each account 


used on every machine. 
Use Master 


CREATE LOGIN [RWDOMAIN\SQLSvc] FROM WINDOWS 


GO 


GRANT CONNECT ON ENDPOINT::Mirroring TO [RWDOMAIN\SQLSvc] 


GO 


-- Run on the mirror server. 

-- This identifies the principal server. 

ALTER DATABASE AdventureWorks SET PARTNER = 

"TCP: //RWClustN1. rwdomain. richardwaymire.com:5822' 


-- Run on the principal server. 

-- This identifies the mirror server. 

ALTER DATABASE AdventureWorks SET PARTNER = 

"TCP: //RWClustN2.rwdomain. richardwaymire.com: 5822" 


-- Run on the principal server. 

-- This identifies the witness server. 

ALTER DATABASE AdventureWorks SET WITNESS = 

"TCP: //SQLClust1. rwdomain. richardwaymire.com:5922' 


PÛ Database Mirroring Monitor 


Figure 8 


rwclustrı1\lo. 
RWCLUST.. 


Principal @ Synchronized 


Mirror @ Synchronized @ Connected 


Unsent log: j0 KB 
Oldest unsent transaction: [00:00:00 


Unrestoredlog: 8፻8 TT 
Time To restore log 
(estimated): 


Time to send lı 
(estimated) : û Current restore rate: |0 KB/sec 
Current send rate: 


Current rate of new 
transactions: 


Principal log (8/4/2010 5:25:11 PM) Mirror log (8/4/2010 5:25:20 PM) 


Mirror commit overhead: 


fo miliseconds 
Time to send and restore all current 


log (estimated) 
Witness address: 
Operating mode: 


[TCP://nwsqldemo rwdomain sichardwaymire.com:5022 
High safety with automatic failover (synchronous) 


Database Mirroring Monitor 


18 October 2010 


After the T-SQL code runs, the Database Proper- 
ties dialog box launches, as Figure 6 shows. To begin 
mirroring, click Start Mirroring. If you made a con- 
figuration error, you'll get an error dialog box such as 
the one in Figure 7. Examples of configuration errors 
include making a typographical mistake when enter- 
ing the server configuration, neglecting to specify in 
the wizard that you aren’t using the same domain 
account on every instance of SQL Server, and forget- 
ting to open the appropriate TCP port in Windows 
Firewall. After you grant firewall exception on all 
three servers, click OK, then click Start Mirroring. 
After the endpoints are created (Listing 1), logins 
are created and connect privileges are granted if they 
don’t already exist for each instance (Listing 2). 

To verify that mirroring is working properly, 
start the Database Mirroring Monitor. Right-click 


TABLE |: Database Mirroring 
Client Connectivity Keywords 
by Technology 


OLE DB FailoverPartner 
ODBC Failover_Partner 
ADO Failover Partner 


(Table information is from “Making the Initial Connection to a Database 
Mirroring Session,” msdn.microsoft.com/en-us/library/ms366348.aspx.) 


AdventureWorks, then select Tasks, Launch Data- 
base Mirroring Monitor. You'll see a window similar 
to Figure 8, which shows that the mirror is in sync. 
If something were amiss, you could view the problem 
and begin troubleshooting steps from this monitor. 
You can also select information from the sys.data- 
base_mirroring catalog view to query the state of 
mirroring. 


Client View 

For the most part, client configuration for mirroring 
is the same as for connecting to any other SQL Server 
instance. You need to ensure that you're using the Data- 
base attribute to switch to the correct mirrored data- 
base (in our case, AdventureWorks). You add a new 
parameter in your connection string, depending upon 
your API (see Table 1). To connect using managed 
code, modify your connection string and set the server 
to something like Server=RWCLUSTN1; Failover 
Partner=RWCLUSTN2; Database=AdventureWorks; 
Network=dbmssocn. In the event of a database 
failover, the client application automatically switches 
to the mirror partner. If you can’t modify the cli- 
ent’s connection strings and you still want a high- 
availability solution, you'll probably have to resort to 
failover clustering. 


Database Mirroring Is Easy 

Once you have the proper security and server 
configuration in place, database mirroring is fairly 
straightforward. You have to bounce around a bit 
in terms of running commands on each of the three 
servers (in the case of a highly safe synchronous 
mirroring solution with a witness), and you have 
to jump back and forth with the initial database 
backup and restore. But after you accomplish those 
tasks, it’s easy to get a mirrored database off the 
ground. 

For more information and details about data- 
base mirroring, see “Database Mirroring” (msdn 
.microsoft.com/en-us/library/bb934127.aspx). For an 
advanced discussion of database mirroring best 
practices, see “Database Mirroring Best Practices 
and Performance Considerations” (technet.microsoft 
.com/en-us/library/cc917681.aspx). ፳፲ 
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Based on 


Filters 
Number of Rows 


and Logical Ordering 


TOR FETCH FIRST, and TOP OVER let you 
write better I-SOL code 


ast month I started a two-part series about 
EE filters. I discussed filters based on 

predicates, including filters that T-SQL 
supports, such as ON, WHERE, and HAVING, 
and filters that T-SQL doesn't support but that 
are defined by standard SQL or other database 
platforms, such as the standard FILTER clause for 
aggregates and the QUALIFY clause that Teradata 
supports. This month I focus on a different kind of 
query filter—one that isn’t based on a predicate, 
but rather on a requested number of rows and 
logical ordering. 

As a reminder, in my examples I use a sample 
database called Inside TSQL2008. You can down- 
load the source code for this sample database 
from www.InsideTSQL.com/books/source_code/ 
Inside TSQL2008.zip. 


TOP 

The TOP filter is a nonstandard filter that was added 
in SQL Server 7.0 to address a common filtering 
need—the need to filter rows based on some logical 
ordering and a requested number (or percent) of rows. 
You specify the requested number of rows right after 
the TOP clause, and the ordering is defined by the 
query’s traditional ORDER BY clause. For example, 
to return the three orders with the highest order val- 
ues, ሃዐ በ use the following query: 


USE InsideTSQL20B8; 


SELECT TOP (3) orderid, orderdate, 
custid, empid, val 

FROM Sales.OrderValues 

ORDER BY val DESC; 


On the surface, the TOP feature's design seems 
straightforward. It's clear that this query will return 
the three orders with the highest order values, and 
it seems obvious that the query will return those 
rows from highest to lowest order values in terms of 
presentation ordering. But in practice this design is a 
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source of much confusion and trouble. To see why, we 
need to go back to T-SQL's foundations. 

T-SQL is based on standard SQL (both ISO 
and ANSI standards), which in turn is based on 
the relational model, which in turn is based on two 
mathematical branches—set theory and predicate 
logic. Last month's discussion revolved around 
predicate-based filters, which gave us a glimpse of 
the relevance of predicate logic to T-SQL. As for set 
theory, understanding some of the key concepts from 
this mathematical branch can give you better insight 
to the meaning of the code you write using T-SQL. 

What's most relevant to our discussion is the fact 
that there's no order to the elements of a set. This 
concept might sound simple, but it's a great source 
of confusion in writing T-SQL code. A table in a 
SQL Server database represents a relation from the 
relational model, which in turn represents a set from 
set theory. You need to keep in mind that in logical 
terms there's no order to a table's rows—never mind 
what you know about the physical layer that imple- 
ments the table (e.g., indexes). Similarly, the result of 
a query without a presentation ORDER BY clause is 
a table result; therefore, there's no guarantee for the 
order of the rows in the output. If you add a presenta- 
tion ORDER BY clause to a query, the nature of the 
result is inherently different than without one—you 
get what standard SQL conceptually considers to be 
a cursor. The idea behind a cursor is that it represents 
a result with guaranteed order, meaning that 1t's 
nonrelational. 

If you're wondering why you should care whether 
the result is conceptually a relation or a cursor, you 
should note that SQL Server doesn't typically let 
you define a table expression, such as a view, inline 
table-valued function, derived table, or common 
table expression (CTE), based on a query with an 
ORDER BY clause. For example, if you try defining 
the following view: 


CREATE VIEW Sales.MySortedOrders 
AS 
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SELECT orderid, orderdate, custid, 
empid, val 

FROM Sales.OrderValues 

ORDER BY val DESC; 

GO 


you get the error in Figure 1. 

The error message doesn't go into a great detail 
of explanation but simply says the ORDER BY 
clause isn't allowed. The reason for disallowing an 
ORDER BY clause in the view's query is that a view 
is supposed to represent a table; a table is supposed 
to represent a relation; a relation is supposed to 
represent a set; and a set is unordered. As an aside, 
Oracle supports the concept of ordered views, which 
is one of the reasons people try to define them in 
SQL Server. But you should realize that support for 
such a construct is in complete contradiction with the 
relational model. 

Going back to TOP, this feature's design doesn't 
let you indicate the logical ordering for the TOP filter 
separately from and independently of presentation 
ordering; instead, it overloads the existing ORDER 
BY clause that was designed originally just for pre- 
sentation ordering purposes with this extra meaning 
for TOP. The two orders are therefore intertwined 
and inseparable. And what if you want to use TOP 
in a query that's used to define a table expression? 
You need an ORDER BY clause to give meaning to 
the TOP filter. But there's a conflict with the restric- 
tion that disallows ORDER BY in table expressions. 
The way SQL Server's designers resolved this conflict 
was by making an exception regarding ORDER BY 
in table expressions when TOP is specified. So for 
example, the following is allowed: 


CREATE VIEW Sales.MyTop30rders 
AS 


SELECT TOP (3) orderid, orderdate, custid, 
empid, val 


Msg 1833, Level 15, State 1, Procedure MyOrders, Line 6 


FROM Sales.OrderValues 
ORDER BY val DESC; 
GO 


Because SQL Server allows an ORDER BY clause 
with TOP in table expressions, some people try to 
abuse this fact and create a “sorted” table expression 
by using TOP (100) PERCENT—for example: 


CREATE VIEW Sales.MySortedOrders 
AS 


SELECT TOP (100) PERCENT orderid, 
orderdate, custid, empid, val 

FROM Sales.OrderValues 

ORDER BY val DESC; 

GO 


But here's the part many people aren't aware of—in 
accord with the relational model, a query against a 
table expression doesn't guarantee presentation order- 
ing unless the outermost query has a presentation 
ORDER BY clause. Presentation ordering is guaran- 
teed only in the immediate level, but not in outer levels 
of the code. So when you issue the following query: 


SELECT * FROM Sales.MySortedOrders; 


you don't get a guarantee for presentation ordering. You 
might or might not get the data in val DESC ordering-- 
there's no guarantee. For example, when I ran this code, 
I got the data in the order shown in Figure 2. 

When the TOP query is the outermost query, the 
ORDER BY clause serves both the presentation and 
TOP filtering-related purposes. For example, consider 
the following query: 


SELECT TOP G) orderid, orderdate, custid, 
empid, val 

FROM Sales.OrderValues 

ORDER BY val DESC; 


The ORDER BY clause is invalid in views, inline functions, derived tables, subqueries, and common table 


expressions, unless TOP or FOR XML is also specified. 


Figure ፤ 
Error message from ORDER BY query 


orderid orderdate custid 

19248 2006-07-04 00:00:00.000 85 

19249 2006-07-05 00:00:00 .000 79 

19259 2006-07-08 00:00:00 .000 34 

19251 2006-07-08 00:00:00 .000 84 

18252 2006-07-09 99: 9: 09.999 76 
Figure 2 


Data result from SELECT query 


empid val 


5 440.00 
6 1863.40 
4 1552.68 
3 654.96 
4 3597390 
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Here you're guaranteed both to get the three orders 
with the highest order values and that they'll be pre- 
sented from highest to lowest values. 

Besides this confusing aspect of the TOP design, 
there's also the fact that because TOP doesn't have its 
own ordering specification that's separate from the pre- 
sentation ordering, you can't directly filter with TOP 
using different ordering than the presentation one. If 
you wanted to filter the three orders with the highest 
order values and present them in order date ordering, 
you'd need to use a table expression, like so: 


WITH C AS 
( 
SELECT TOP (3) orderid, orderdate, 
custid, empid, val 
FROM Sales.OrderValues 
ORDER BY val DESC 
) 
SELECT * 
FROM C 
ORDER BY orderdate; 


Now that the TOP query is used in a table expression 
definition, that query's ORDER BY clause serves 
only the TOP related ordering, whereas the outer 
query's ORDER BY clause serves the presentation 
ordering. 

SQL Server 2005 introduced support for TOP 
with modification statements, which led to even more 
trouble because the design of the TOP feature relies 
on the presentation ORDER BY clause to also give 
meaning to the TOP filtering, and because modifica- 
tion statements don't have a presentation ORDER 
BY clause. The solution was to simply not allow an 
ORDER BY clause. So you can add TOP to modifi- 
cation statements—for example: 


DELETE TOP (5@) FROM Sales.Orders; 


However, because you can't indicate an ORDER BY 
clause, you can't really control which 50 rows will 
be deleted—it’s a pure matter of optimization. SQL 
Server will delete the 50 rows it physically happens to 
stumble onto first. I’m not saying there are no uses 
for this capability with modifications, but the uses 
are limited. 

You can use a table expression as a workaround. 
For example, to delete the 50 oldest orders, use: 


WITH C AS 
( 
SELECT TOP (58) * 
FROM Sales.Orders 
ORDER BY orderdate, orderid 
) 
DELETE FROM C; 
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Of course, if the TOP clause supported its 
own ordering specification, there wouldn't be 
any confusion and you wouldn't need to use a 
workaround. 

One final thing that's still missing from the 
TOP design, other than supporting its own order- 
ing, is the ability to define partitioning. For 
example, it would be nice if you could request the 
three orders with the highest order values for each 
customer. 


FETCH FIRST 

1 kept hoping that if standard SQL ever came up 
with a design for a filter based on number of rows 
and logical ordering, it would be one that supported 
its own ordering specification, as well as a partition- 
ing concept. The SQL Server 2008 standard indeed 
introduced a new filtering clause called FETCH 
FIRST that's based on number of rows and logical 
ordering, but it's very similar to TOP. 

The FETCH FIRST clause also overloads the 
existing ORDER BY clause with this extra filtering- 
related meaning instead of supporting its own 
ordering specification. Like TOP, FETCH FIRST 
doesn't support a partitioning concept. In terms of 
syntax, you specify it after the ORDER BY clause, 
like so: 


SELECT orderid, orderdate, custid, 
empid, val 

FROM Sales.OrderValues 

ORDER BY val DESC 

FETCH FIRST 3 ROWS ONLY; 


Because this filtering clause is tied to the traditional 
ORDER BY clause, the standard had to legalize 
the use of ORDER BY in table expressions when 
the FETCH FIRST clause is also specified. If 
doubt ever existed regarding guarantees for pre- 
sentation ordering when a table expression query 
has an ORDER BY clause but the outer query 
against the table expression doesn't, here's what the 
standard has to say: 

“A derived table is a table derived directly or 
indirectly from one or more other tables by the 
evaluation of an expression, such as a <query 
expression> or <table expression>. A <query 
expression> can contain an optional <order by 
clause>. The ordering of the rows of the table 
specified by the <query expression> is guaranteed 
only for the <query expression> that immediately 
contains the <order by clause>.” 

Unless the outermost query has an ORDER 
BY clause, no presentation ordering guarantee 
exists. Even though the behavior is well defined by 
the standard, you need to be aware of it to know 
what to expect. This design is just as confusing as 
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TOP--even if SQL Server adds support for it in 
the future, users will have the same trouble and 
confusion as with TOP. Most people won't know 
what the standard says about presentation ordering 
guarantees and will therefore keep trying to abuse 
such features to create illogical constructs such as 
sorted views. 


QUERY FILTERS 


TOP OVER 


What kind of design for a filtering clause based on 
number of rows and logical ordering wouldn't be 
confusing? The standard addressed similar needs 
with window functions, designing them with an 
OVER clause in which you can indicate partition- 
ing and ordering that's specific to the window 
function. So why not design the filtering clause 
similarly? For example, imagine you could use the 
following query to return the three orders with the 


highest values for each customer: 


SELECT TOP (G3) OVERCPARTITION BY custid 
ORDER BY val DESC) 
orderid, orderdate, custid, 
empid, val 
FROM Sales.OrderValues; 


Wouldn't that be great? There's no confusion 
between the filter-related ordering and presenta- 
tion ordering, and you can define partitions. In 
addition, this design lets you define filter-related 
ordering with modification statements. 

Perhaps the filtering clause should use a dif- 
ferent keyword than OVER, because OVER has 
a very specific meaning in the context of win- 
dow functions. A better alternative might be the 
following: 


SELECT TOP (3, PARTITION BY custid 
ORDER BY val DESC) 
orderid, orderdate, custid, 
empid, val 
FROM Sales.OrderValues; 


But the idea remains the same—no ambiguity; no 
confusion. If you think this feature would be useful 
in SQL Server, you can vote for it at the Microsoft 
Connect page (connect.microsoft.com/SQLServer/ 
feedback/details/254390). 

In the meantime, you can get similar func- 
tionality by using the ROW_NUMBER function 
like so: 


WITH C AS 
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( 
SELECT ROW_NUMBERO OVERCPARTITION 
BY custid ORDER BY val DESC) 
AS rownum, 
orderid, orderdate, custid, 
empid, val 
FROM Sales.OrderValues 
) 
SELECT * 
FROM C 
WHERE rownum <- 3; 


Of course, the outer query isn't limited to being 
a SELECT query. It can also be a DELETE or 
an UPDATE statement. 


Write Better Code 
Filtering is a common and fundamental need 
when querying data in SQL Server. You'd think 
that filtering would be a simple and straightfor- 
ward task, but as I discussed last month and this 
month, filtering is actually a deep and involved 
subject. I described both predicate-based filters 
and filters based on number of rows and logical 
ordering, and I described filters that SQL Server 
supports, as well as filters 1t doesn't support. 
To write better code, you must first understand 
T-SQL’s foundations, and you need to appreciate 
the subtleties and complexities of filters. [SQL] 
InstantDoc 1D 125766 
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SQL Server 


Virtualization 
and Availability 


Be more efficient and get more uptime through 


virtualization 


irtualization can make your IT infrastruc- 
V more efficient and flexible. One of 

the most common uses for virtualization 
is in the area of server consolidation. Virtualiza- 
tion lets you run multiple server instances on a 
single hardware platform, making more efficient 
use of your existing servers. This results in higher 
hardware resource utilization, lower power require- 
ments, and fewer operational requirements. How- 
ever, virtualization has other benefits beyond server 
consolidation—it can also improve the availability 
of your servers. In this article you'll learn about 
some of the ways that virtualization can be used to 
improve the availability of SQL Server systems in 
the areas of disaster recovery, planned downtime, 
and dynamic resource management. Pll focus on 
the options that are available through the Microsoft 
technology stack, but similar options are available 
from VMware. 


Backing Up Virtual Machines 
for Disaster Recovery 

One of the most basic ways that virtualization can 
improve SQL Server availability is in the area of 
disaster recovery. Virtualization abstracts the server 
OS from the underlying hardware. This separation 
provides a lot of flexibility in how you can back up 
and restore a server in the event of a hardware failure 
or site outage. 

There are two basic options for backing up 
Hyper-V virtual machines (VMs). You can back 
up at the Hyper-V virtualization host level or the 
Windows Server guest OS level. (It's important to 
remember that these system backups aren't substi- 
tutes for your regular database backups.) Backing 
up a VM at the guest level is essentially the same 
as a normal backup of a physical server. Windows 
Server Backup is able to use Volume Shadow Copy 
Services (VSS) to take live backups of SQL Server. 
You can restore either at the volume level or at the 
individual file level. 
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You can back up at the guest level, but virtualiza- 
tion also lets you perform VM backups at the host 
level. Windows Server's VSS enables you to back 
up running VMs with no downtime. One of the big 
advantages of backing up at the virtualization host 
level is the fact that the saved VMs can be quickly 
restored to another Hyper-V host. There's no need 
for bare metal restores or reinstalls of the OS, SQL 
Server, or its databases—the VM backup contains all 
of these. SQL Server and its databases can be avail- 
able as quickly as the VM restore can be completed 
and attached to another Hyper-V host. 

To take advantage of online host backups, the 
VM host OS needs to be VSS-aware and the Hyper-V 
Integration Services must be installed in the VM. 
Windows Server 2008 R2, Windows Server 2008, 
Windows Server 2003 R2, and Windows Server 2003 
all support VSS backups. VMs running guest OSs 
that don't have support for VSS backups, such as 
Windows 2000 Server or VMs without Integration 
Services installed, will save their state when the VSS 
snapshot is taken. The state will be restored after the 
snapshot is complete. 

Hyper-V is VSS-aware, but VSS compatibility 
for Hyper-V isn't enabled by default. For Windows 
Server Backup to be able to back up a Hyper-V 
VM using VSS, you need to add the registry key 
HKLM\Software\Microsoft\Windows NT\Cur- 
rentVersion\WindowsServerBackup\Application 
Support\{66841CD4-6DED-4F4B-8F17-FD23- 
F8DDC3DE}. After adding the key, you need to 
create the following string value under it: 


Name: Application Identifier 
Type: REG_SZ 
Value: Hyper-V 


For more information about Hyper-V and Windows 
Server Backup integration, refer to the Microsoft 
article “How to Back Up Hyper-V Virtual Machines 
from the Parent Partition on a Windows Server 
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2008-Based Computer by Using Windows Server 
Backup” at support.microsoft.com/kb/958662. 

For disaster recovery purposes, you'd typically set 
up a process to move the VM backup to a disaster 
recovery site. You can do this manually or by using 
File Replication Services, Distributed File Services 
Replication, or even scripted FTP transfers. For more 
information about using DFS for disaster recovery, 
see “Disaster Recovery with DFS,” InstantDoc ID 
39768. 

To restore a VM, simply run Windows Server 
Backup and select the Recover option from the 
Actions Pane. When performing the restore, select 
Application Restore and choose Hyper-V. This will 
display the VMs that can be restored. 

As an alternative to building a homegrown 
solution for disaster recovery, there are also several 
add-on products that facilitate disaster recovery for 
Hyper-V VMs, such as Microsoft’s Data Protection 
Manager 2010, AppAssures’s Replay Replication, 
CA ArcServe Replication, Double-Take Software’s 
Double-Take Availability, EMC’s Symmetrix Remote 
Data Facility, R1Soft’s Continuous Data Protection, 
and SteelEye DataKeeper for Hyper-V. 


Using Live Migration to Reduce 
Planned Downtime 

Live Migration, the equivalent of VMware’s 
VMotion, fills an important hole in Micro- 
soft’s Hyper-V virtualization platform. It lets 
you move VMs between Hyper-V hosts with 
no downtime. Live Migration lets you perform 
planned maintenance on Hyper-V hosts without 
incurring any end-user downtime from the appli- 
cations that use the database servers on those 
VMs. Using Live Migration, you can move all 
the VMs from the physical Hyper-V host that 
needs maintenance to another Hyper-V host. 
You can perform whatever hardware or software 
maintenance that the host requires, then migrate 
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the database server VMs back to the original 
server when you're done. 

Live Migration can work with any version of SQL 
Server, but it requires Windows Server 2008 R2 or 
Hyper-V Server 2008 R2 as the virtualization host. 
In addition, the Hyper-V servers must be nodes in a 
Windows Failover Cluster. Clusters can have up to 16 
nodes and Live Migration can take place between any 
of the nodes in the cluster. You can see an overview 
of how Live Migration works in Figure 1. 

Live Migration takes place between two Hyper-V 
hosts. Essentially, the VM configuration and memory 
of the source VM is copied to a target Hyper-V host. 
After the memory is copied to the target Hyper-V 
host, the VM on the new host can access its VHD 
files, which are stored in a Cluster Shared Volume 
(CSV), and the VM can continue to run without 
interruption. Server 2008 R2' new CSV storage lets 
both Hyper-V hosts simultaneously access the stor- 
age where the VM's Virtual Hard Disk (VHD) files 
are located. 

When you initiate a Live Migration, the following 
steps occur: 

1. A new VM configuration file is created on 
the target server. 

2. The source VM’s initial memory state is 
copied to the target. 

3. Changed memory pages on the source VM 
are tagged and copied to the target. Step 3 contin- 
ues until the number of changed pages is small. 

4. The VM is paused on the source node. 

5. The final memory state is copied from the 
source VM to the target. 

6. The VM is resumed on the target. 

7. An ARP is issued to update the network 
routing tables. 


Live Migration works exactly as advertised. You can 
run multiple queries on a SQL Server system that’s 
being live migrated to another Hyper-V host and 
there'll be no interruption of service—the queries will 
run to completion with no problems. Hyper-V Live 
Migration can effectively reduce planned downtime 
for SQL Server database server hardware and soft- 
ware maintenance. For more detailed step-by-step 
information about setting up Live Migration, refer to 
“Hyper-V Live Migration: A Step-by-Step Guide,” 
www.sqlmag.com, InstantDoc ID 125262. 


Dynamic Resource Manage- 
ment for Enhanced Application 
Availability 

Availability isn’t limited to server uptime. True avail- 
ability must be measured by the end user’s ability to 
access computing resources. If the server's responsive 
time is affected, availability is also affected because 
the users can’t get the information they need to do 
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their jobs. This can happen even when the database 
server is up and fully available but demands for system 
resources overload the server, making it unresponsive. 
Unacceptable response time is definitely a form of 
downtime. 

Virtualization and Live Migration are able 
to work together to provide the foundation for 
establishing dynamic management of IT resources. 
Virtualization lets you encapsulate a workload so 
that you can move it (within a VM) between dif- 
ferent Hyper-V hosts. Live Migration is the trans- 
port mechanism that can move the VMs between 
Hyper-V hosts with no downtime. System Center 
Virtual Machine Manager (VMM) 2008 R2 and Sys- 
tem Center Operations Manager (SCOM) 2007 are 
the key components that are required to automate 
the process of moving database server VMs between 
Hyper-V servers. 

SCOM can monitor system resource levels, such 
as the CPU and memory utilization, of both Hyper-V 
hosts and the VMs running on those hosts. VMM 
2008 R2's Performance and Resource Optimization 
(PRO) feature ties SCOM’s monitoring capabilities to 
VMM's management capabilities. SCOM can detect 
when either Hyper-V hosts or VM guests exceed 
predetermined operational levels, such as when CPU 
utilization exceeds 60 percent, and then trigger a 
VMM PRO Tip. VMM's PRO Tips can automati- 
cally initiate a Live Migration, moving one or more 
VMs to other Hyper-V hosts. Automated VMM PRO 
Tips let your IT infrastructure dynamically respond 
to periods of high resource utilization by automati- 
cally moving VMs to hosts with greater capacities or 
lower workloads. 

VMM PRO Tips can be generated for a variety of 
reasons, including the lack of resources on a Hyper-V 
host or VM, hardware problems (such as temperature 
overages), power supply failure, network congestion, 
fan failure, or application performance falling below 
an acceptable threshold. A system can respond to 
a PRO Tip manually or automatically. PRO Tips 
can be informational or define actions required to 
correct a given operational condition. Remediation 
of PRO Tips can include migrating VMs to other 
hosts and starting VMs. For example, if the load on 
a web farm rises above a certain level, a PRO Tip can 
automatically start additional web server VMs to 
handle the load. 

You can see an example of dynamic resource 
management using Hyper-V, SCOM, and VMM 
in Figures 2 and 3. In Figure 3, the warning icons 
indicate where a PRO Tip has detected that the 
SQL Server workloads running in a given VM have 
exceeded the predefined operational limit. Auto- 
mated implementation of the PRO Tip let VMM live 
migrate the VM to the next node in the cluster, which 
had a lower workload—dynamically adjusting to the 
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VMM PRO and Dynamic Resource Management before a PRO Tip is detected 
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VMM PRO and Dynamic Resource Management after a PRO Tip is detected 


situation that caused the operational problem and 
correcting it. Dynamically moving the SQL Server 
database VM to a Hyper-V host that can deliver 
better performance provides end users with improved 
application responsiveness and availability by match- 
ing the VM’s workload to the Hyper-V host with the 
available resources. 


Virtually Better Availability 
Virtualization can add a lot of value to an orga- 
nization, but one of its most important aspects is 
the ability to enhance the availability of workloads, 
such as SQL Server, that run in the VMs. SQL Server 
was once considered to be an application that was 
impossible to virtualize, but today many businesses 
have successfully virtualized SQL Server. By doing 
so, these businesses have gained benefits in several 
different areas, including better utilization of their 
resources, easier manageability, and improved avail- 
ability. ፳፲ 
InstantDoc ID 125845 
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erformance is always a concern, but unless it's 
ዜ2. problem, it's rarely a priority. That's why so 

many database applications run into perfor- 
mance problems—most applications are built around 
a “get it done yesterday” schedule and performance 
concerns are secondary to release. In fact, in most 
cases, all a developer focuses on is how an application 
works for a single user on a high-end developer box. 
“It works on my machine” is the response to most 
bugs that emerge just prior to, during, or shortly after 
deployment. 

Performance is anything but a new issue, and over 
the years more and more resources and tools have 
been created to help quickly identify performance 
bottlenecks. Visual Studio (VS) 2010 includes the 
latest version of one such tool, the VS Performance 
Profiler. This feature-rich tool includes several new 
capabilities, including some that are specific to check- 
ing the performance of your ADO.NET data access. 
In this article, Pm going to provide you with an 
introduction to this tool and then walk you through 
using it to debug solutions in a couple of specific 
performance scenarios. 

To follow along with the examples discussed in 
this article, you'll need VS 2010 Professional, Pre- 
mium, or Ultimate. The analysis tab is available in 
all of these versions. To use the analysis tools, you'll 
also need an application to test. There's a simple 
application solution associated with this article with 
VB and C# versions available for you to download 
from www.sqlmag.com, InstantDoc ID 125847. The 
sidebar “Create a Sample Application for Perfor- 
mance Profiler to Test” describes loading or creat- 
ing the solution. This article is going to gloss over 
creating the sample applications and mainly look at 
testing this code's behavior. 

I developed the sample applications against 
a local SQL Server 2008 R2 database and tested 
them with SQL Server 2008 —your results may vary 
with other database versions. However, the version 
of the database isn’t a dependency for using the 
performance profiling tool. The provided examples 
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use the standard AdventureWorks sample database 
available for download from CodePlex at msftdb 
prodsamples.codeplex.com. 


Performance Profiler 

Performance Profiler has enough features and capa- 
bilities that I could write a book chapter and still 
not cover them all, so I'll start by looking at some 
of the key features and defining some of the areas 
that I won't explore in this article. With this article, 
Pm aiming to introduce database developers to 
some of the tools that are available for performance 
monitoring, so anything dealing with topics such 
as high-end High Performance Cluster testing and 
elements related to parallelism is outside the scope 
171 cover. 

Your first step is to go to VS 2010 and 
start it with administrator privileges. (Note 
that it’s possible to performance test appli- 
cations using the command line, but 1] be 
working from within VS for the purposes of 
this article.) VS should be started with admin- 
istrator privileges because to use the performance tools, 
you'll access system-level information that's protected. 
The sample application I used for this article is a Win- 
dows Forms application, but you can create your own. 
Performance Profiler can be used with both ASP NET 
and WPF applications (but not Silverlight applications) 
and can even be configured to profile JavaScript func- 
tions, but I’ll focus on ADO.NET. 

Once you’ve opened or created a new Windows 
Forms application, you'll have access to the Analyze 
menu within VS. The first menu item is Launch 
Performance Wizard, which opens the screen that 
Figure 1 shows. This initial screen is useful as a 
starting reference because it illustrates the top-level 
options for performance profiling. Before I move on 
111 describe the focus of each of these options so you 
can have some idea what each of them is for. 

If you're familiar with the VS 2008’s Performance 
Profiler, you're probably aware that it didn’t support 
virtual environments—you had to be running on the 
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CREATING A SAMPLE APPLICATION 
FOR PERFORMANCE PROFILER 


To follow along with this Performance Profiler article, you need to create 
and use a simple .NET application. For the purposes of this article you'll 
need VS 2010 Professional Edition or above. If you choose to download the 
sample code and use it, be aware that although I’ve included a couple of 
sample performance runs, you'll need to engage the Performance Profiler in 
your environment. 

Unzip the downloaded solution under your typical project location for 
VS, then double-click the solution file to open the sample application in 
VS. The solution might load and, depending on your security settings, you 
might get a warning about the code coming from an unsafe location. Once 
the solution loads, however, VS's layout should look similar to what you 
normally see, which is the challenge. If you look in the Solution Explorer, 
you should see the Solution Items folder. By default, this will probably be 
empty. Right-click this folder in the Solution Explorer, and from the context 
menu, select Add, then Existing Item to open File Explorer. You'll see 
several .psess files listed in the solution folder. Select one or more of these 
files, and in particular the perfTest.psess file. 

Once these files have been added to your solution, you'll see the Perfor- 
mance Explorer open automatically, along with copies of my sample test 
runs for this application. At that point you can either review the historical 
data or create your own sample data based on my very basic application. If 
you intend to run the application, don't forget to install the custom stored 
procedures, the scripts for which are part of the download package. 

The alternative to using the downloaded files is creating your own solu- 
tion from scratch. This sidebar isn’t large enough for me to include all of 
my custom code, but l'Il describe what | did to create a project suitable 
for this article. | started by using the File menu to select New and open 
the new project template dialog box. From this dialog box | selected a new 
Windows Forms project, which | named perfTest. At this point VS gener- 
ated the shell of my project, including the main form. To the main form | 
added a comboBox control, below that a button, and below that a grid. | 
added a label control to the right of the comboBox control. 

Next | double-clicked the button control so that VS would generate an 
event handler for the button and take me to the code for my solution. In 
the code, | made several modifications. | started by adding a private integer 
value called mgrlD, which is used to hold the internal ID value used in 
the queries. As part of the Form1_Load event, | added the three stored 
procedure names and defaulted to the uspGetManagerEmployee entry. 
Remember that both uspGetManagerEmployee109 and uspGetManagerEm- 
ployeeOneLevel are custom stored procedures that you need to manually 
add to your copy of AdventureWorks using the provided scripts. 

Then, within the generated event handler for the button, | retrieved the 
currently selected entry in the comboBox control and then branched to 
a different private method for each of the three stored procedures. Each 
private method returns a data table that’s then assigned as the data source 
for the grid to display the query results. 

The private methods for each stored procedure are similar in that 
each creates a new list of SqlParameter objects, creates the appropriate 
parameters for the stored procedure, and then calls a function to return the 
resulting data table. Of course, the details are what make this code work, 
but each eventually calls a standard function to actually access the data- 
base. The class DatabaseFunctions is more custom code that | included to 
encapsulate my database call and basic error handling for this sample. 

Whether you use my sample code or create your own, you'll need a proj- 
ect that accesses a database in order to leverage the Performance Profiler 
in VS 2010. My sample code is just a very basic implementation to let you 
quickly set up and test a few different scenarios with this tool. You're wel- 
come to enhance my code if you want to build a reusable database query 
test tool, but that wasn't its purpose. 
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native OS to use sampling. One of the important 
enhancements with VS 2010 is that you can conduct 
CPU sampling tests on applications running in a 
virtual environment. The key to sampling, however, 
is that your application runs unchanged and Perfor- 
mance Profiler simply monitors the CPU to track 
performance while your application runs. This is the 
recommended performance modeling start point 
because it most accurately tracks your overall impact 
on the system. 

The second option, Instrumentation, works by 
taking your compiled code and adding special calls 
to the profiling engine to track the time when each 
method is entered and exited. Although it may not 
be the best place to start for your profiling, it's an 
excellent tool for narrowing down a specific method 
that might be a performance bottleneck. 

The .NET Memory Allocation option lets you 
track an application's memory use over time. It also 
uses a sampling engine, so it doesn't instrument your 
code to track exactly when memory is allocated. The 
idea behind this option is to let a developer spot an 
application that's consuming ever-increasing amounts 
of memory while running. Although memory leaks 
are now limited by the .NET garbage collector, an 
application can maintain unrecognized top-level ref- 
erences to objects, and thus perform in a manner that 
looks similar to an application leaking memory. 

The final option at this top level is Concurrency, 
which is focused on thread management. As I noted, 
Tm not going to focus on concurrency issues, but this 
tool targets those issues. Most race condition—type 
errors occur when an application is under stress and 
can then be difficult to recognize—by combining 
Performance Profiler with a stress test you can get a 
picture of any thread-related issues. 

At the bottom of the screen shown in Figure 1 
is an important link. Following this link takes you 
to MSDN, which discusses more about the profiling 
methods supported by Performance Profiler. The 
web page also introduces one of the key areas that's a 
focus of this article, ADO.NET Tier interaction. 


Profiling Data Tier Interaction 
As noted in the MSDN article “Understanding Profil- 
ing Methods” at msdn.microsoft.com/library/dd264994 
.aspx, there's another profiling method called tier 
interaction. This type of profiling runs in parallel with 
either sampling or instrumentation and lets you collect 
performance information on your SQL Server data- 
base calls. When you enable this type of profiling for a 
test, Performance Profiler will capture how much time 
your application spent in the database and how often 
it went there. It also provides information related to the 
minimum, maximum, and average time spent on your 
queries and provides a detail view that lets you get these 
same basic measurements on a per-query basis. 
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To get to where you can access this type of profil- 
ing, you need to complete the wizard. It doesn't mat- 
ter which of the four main profile methods you select 
in the wizard. The next screen lets you select from 
an available project, and by default your currently 
loaded project will be selected. Select Next on this 
screen and Finish on the next screen to continue and 
complete the wizard. Once you complete the wizard 
you should see the Performance Explorer, which 
Figure 2 shows. It lets you manage the performance 
session. Figure 2 shows a performance session where 
Performance Profiler has already been run once. The 
perfTest session shown in Figure 2 is a persistent 
item that can be run repeatedly, keeping track of 
each set of results and the current settings for your 
performance profiling. 

The toolbar at the top of this window will let you 
run the wizard again, which you'll need to do if you 
want to generate another performance profile using 
the same or a different profiling method. The second 
button lets you quickly create a baseline session, and 
the third button lets you run a performance profile 
session. You'll also note that as part of the toolbar 
at the top of the window in Figure 2, there's a drop- 
down menu that lets you switch the currently selected 
session between some of the different profiling meth- 
ods. However, while you can switch from sampling to 
instrumentation, this toolbar doesn't provide access 
to the tier interaction setting. 

If you want to enable tier interaction profiling, 
you need to open the property pages for a selected 
performance session. To get to the property windows 
shown in Figure 3, right-click the performance ses- 
sion in Performance Explorer and select Properties 
from the context menu. You might see a performance 
session within the Solution Explorer display, but the 
context menu from that display won't open the cor- 
rect property pages. Once you've opened the property 
window, you need to select Tier Interaction from the 
list of pages in the left column to get the full display 
shown in Figure 3. At this point, it's simply a matter 
of enabling the Tier Interaction setting within your 
performance session. 

There's quite a bit of information and customi- 
zation available based on the session settings, but 1 
won't focus on them. I suggest looking at the different 
sampling triggers and settings related to instrumenta- 
tion and CPU and other counters that you choose to 
capture as part of your performance session. Now 
that you’ve got a performance session ready to collect 
the tier interaction data, you can run a session and 
review and interpret the results. 


Using Performance Profiler 

One of the major advantages of Performance Profiler 
is that, unlike the static analysis settings that affect 
all of your compilations, you can run and test your 
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application with Performance Profiler open, but 
without having to actually start Performance Profiler. 
When you're ready to profile, you'll want to use Start 
Profiling within the Performance Explorer to start your 
application. The application should start normally, and 
you simply execute a variety of tests to run your code. 
For the case of this article, the sample code includes a 
few stored procedure database calls that are listed in 
the drop-down list, a button to trigger the execution 
of the call, and a grid where the results are displayed. 
There's also a label to display the query chosen, so you 
can remember which query 1t was that you just ran. 
This article isn't going to focus on the creation of the 
custom code behind this project—see the sidebar for 
more information. 

For the database part of my custom develop- 
ment, I took an existing stored procedure from the 
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sample AdventureWorks database and modified 
it twice. The baseline stored procedure 1 selected 
was uspGetManagerEmployees. This stored proce- 
dure traverses the employee table recursively and 
retrieves all of the employees for a given manager 
ID. First I made a hard-coded version that was set 
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Query Calls Total Elapsed . Min Elapsed Ti... Max Elapsed T~ Avg Elapsed Ti. 


ADA TA 


Count Total Elapsed... Min Elapsed Ti... Max Elapsed T~ Avg Elapsed 1... 


li 
418 161.16 0.23 43,66 0.39 
2 6320 31.31 31.89 31.60 


to use employee 109 exclusively, letting me test my 
code without parameters. I included this version 
of the procedure in the download because the code 
still references it. Note that you need to run the 
CREATE script manually for each of my custom 
versions of this stored procedure. Also remember 
that I’ve only tested these stored procedures and 
code against SQL Server 2008 R2 and SQL Server 
2008. 

The second custom version of the stored proce- 
dure uspGetManagerEmployeeOneLevel is the focus 
of the performance profile. This stored procedure 
only retrieves a manager's direct reports. Doing this 
results in a much different query, but in order to mir- 
ror the original query’s results, the code must repeat- 
edly call it—the first-level reports for a manager must 
all be queried for their direct reports, those reports 
must be queried for their reports, and so on. I put the 
recursive calls to do so in place so that as many as 
200 or 300 database queries might execute to retrieve 
the same information. The result is that although I 
don’t sort the data, the same data is provided to the 
data grid. 

At this point, if you’ve loaded the sample code 
download, you'll see that Pve included several copies 
of the test results. For your purposes, however, the 
next step is to click the Start Profiling button within 
the Performance Explorer to execute the code. You 
can query for each of the three stored procedures as 
many times as you'd like. After retrieving the data at 
least once for each one, close the application. This will 
end the profiling session and VS will automatically 
generate and open a report. 

Figure 4 shows a portion of the default report 
view. I’ve omitted most of the report because it 
doesn’t include the Tier Interaction data. You need 
to use the Current View drop-down menu that Figure 
4 shows to select the Tier Interaction view. Doing so 
will take you to a display similar to what you can 
see in Figure 5, although your view won’t open fully 
expanded. Selecting the (local) item in the top grid 
will pull up the database interactions in the Database 
Connections details grid. 

You can see in Figure 5 that in this test case, I 
actually ran each of my two data retrieve options 
twice, resulting in 420 calls to the database. The 
lower window breaks these calls out by stored 
procedure and, as expected, it shows that the 
uspGetManagerEmployeeOneLevel stored proce- 
dure makes up 418 of the 420 calls to the database. 
The report data is fairly simple and easy for a devel- 
oper to understand. If a developer finds a database 
tool’s information to be abstract or misleading, they 
can quickly see that if they let the database do the 
work of building the employee hierarchy, the overall 
performance is better. Although the simple query 
executes faster, the current implementation ends 
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up taking longer because of how many times the 
database is queried. 


Right Tool for the Job 

VS 2010 provides database application developers 
with an excellent new way to monitor what's hap- 
pening to their applications as they connect to their 
databases. For a full-time, fully trained DBA, the 
view this tool presents might feel overly simplistic, 
but the example presented in this article was just a 
single contrived example. The key fact to take home 
is that a developer can use this tool to find out why a 
query is being executed hundreds (or thousands) of 
times for a single action, or why a query is taking 10 
or 20 times longer than a similar query. Performance 
Profiler isn't a diagnostic tool to show what's happen- 
ing in the database. You can use it to investigate and 
diagnose application behavior and to call out in more 
detail activity with the database that might otherwise 
go unnoticed by the developer. 

Although I used a Windows Forms application, 
you could just as easily run Performance Profiler for 
an ASP.NET website. Although you can’t perfor- 
mance test a Silverlight application, you can provide a 


PERFORMANCE PROFILER 


mock UI and performance test the WCF web services 
used by that application. 

I haven’t covered capabilities, such as adding 
custom performance settings, reviewing the coding 
standards guidance the tool generates, or several other 
new features. Feel free to check them out, because if 
you're using a version of VS 2010 above the Express 
version, you have this tool built in—it isn't one of the 
myriad tools that are available only with a high-end 
MSDN subscription. 

Using this tool instead of more DBA-oriented 
tools means that you might not get as many details 
about what your T-SQL scripts and stored proce- 
dures are doing, but Performance Profiler provides 
you with a window to find out whether these compo- 
nents might need further examination. Performance 
Profiler doesn’t replace a tool like SQL Server Pro- 
filer, but for someone who’s more developer than 
DBA or who’s working in a small shop and doesn’t 
frequently use SQL profiling, Performance Profiler 
provides a way for you to get a great baseline for the 
performance of your application’s database access. 

፳፲ 
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Build a Cube to Perform 


Risk Analysis 


Run simple simulations of changing conditions 


hen a client, who I'll refer to as Big Green 

Cola, asked my team to help them “run 

scenarios” using a cube, we were skeptical 
as to whether a cube was the right technology to use. 
We wanted to make sure that we would be able to 
meet their requirements, and we feared that the client 
was hoping to run extremely complex simulations that 
wouldn't be possible using cube technology. After sev- 
eral meetings, during which we discussed the client's 
needs in more detail, we agreed to attempt to design 
and develop a cube to run the scenarios. 

Big Green Cola manufactures and distributes soft 
drinks. The company contracts with independent 
distributors, who in turn distribute to retail outlets. 
Given the recent upheaval in the economy, the cor- 
porate officers at Big Green Cola became concerned 
that some of their independent distributors might not 
be able to continue distribution. 

Big Green Cola wanted to do scenario-based risk 
analyses that would simulate the potential failure of 
some of their distributors to see what effect 1t might 
have on their operations. Big Green Cola wanted 
to be able to plan for the possible reassignments of 
retail outlets to competing independent distribu- 
tors on short notice in the event that a distributor 
declared bankruptcy or otherwise went out of busi- 
ness. The cube would let them see which geographic 
areas would be affected by the failure of a particular 
distributor and which other Big Green Cola distribu- 
tors were already operating in that same geographic 
area or nearby. 

A major user requirement was that the Scenarios 
Cube show the original structure of the route and 
its distributor, even after a distributor and its route 
were cast as “unhealthy” in a given scenario. In other 
words, the change in status would be reflected only 
in the measures and calculated members, not in the 
hierarchies or pivot table layouts. This would make 
actual versus scenario distribution patterns more 
obvious, and thus instructive, for the users. Let's take 
a look at how we designed and built a cube that let 
our client use a cube to run simple simulations of 
changing conditions. (You can download the sample 
cube by going to www.sqlmag.com, InstantDoc ID 
125818, and clicking Download the Code Here.) 
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Distribution Scenarios Cube 
Star Schema 

We had already provided Big Green Cola with a general 
usage cube that included dimensions for Date, Time, 
Product, Distributor, Route, Retail Location, and 
Geography. The Distribution Scenarios cube reused 
four of those dimensions: Distributor, Retail Location, 
Route, and Geography, as shown in Figure 1. 

The original Distributor dimension contained a 
distributor ID, distributor code, and distributor name. 
The RetailLocation dimension included only Retail- 
LocationID and RetailLocationName, although there 
are plans to add groups and categories to this dimen- 
sion in the future. 

The Route dimension included information from 
each distributor regarding how that distributor 
groups and runs its delivery logistics. Big Green Cola 
was considering using additional data provided by the 
distributors for even better short-term contingency 
planning using geocoded data, but the 
company decided to table that enhancement 
for a future release. In lieu of the geocoded 
data, the Routes dimension simply provided 
a recognizable grouping of the locations 
that Distributors actually use in their plan- 
ning and delivery and that the Big Green Cola users 
found helpful when developing contingency plans. 


(kristl.tyler @ emc.com) is a business 
intelligence architect for EMC Consulting. 
SQL Server Analysis Services is her favorite 


layer in the BI stack. 


MORE on the WEB 


See the web figures and 
listings and download the 
code at InstantDoc ID 125818. 
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Figure | 


The star schema 


October 2010 37 


Kristl Smith Tyler 


SCENARIO-BASED RISK ANALYSIS 


The original Geography dimension included the 
States, Regions, and Divisions of the United States 
used by the Census Bureau. For more granularity, 
we added zip code data provided by the US Postal 
Service. The original Geography dimension, therefore, 
contained the Region, Division, State, City, and Zip 
Code levels. For the purposes of this article, however, 
we reduced the dimension so that it shows only State- 
Name, City, and ZipCode. 


Scenario Facts 

The single measure in our fact table was based on the 
Cases field and represented the average case volume. 
The average case volume was derived by rolling up 
the total number of cases of all varieties of soft drink 
products delivered to a single retail outlet during 
a seven-day calendar week. These weekly numbers 
were averaged for data going back three months. The 
measure was simply called Cases in the fact table and 
was renamed Total Cases in the cube. 

Once the fact table was loaded we created a view 
that triples the amount of data stored within it. Our 
reason for tripling the data was that it was quite 
simply the only way we could come up with to run 
the scenarios. In the case of our particular data set, 
triplication didn’t present a performance problem. In 
the event that you’re working with a very large set of 
facts, and you still want to use triplication, I recom- 
mend looking for opportunities to roll up the data to 
smaller sets prior to triplication. 

To understand the purpose of data triplication, 
begin by considering the fact that dimensions, in 
common usage, act either as bucketing mechanisms 
(for slicing) or as filters (for query results or data 
outcomes). When dimensions are placed on the 
column or row axes of pivot tables, they act as slic- 
ers. When members of a dimension are selected or 
deselected, they’re acting as filters. Our interest was 
in letting users select members as a way of selecting 
a scenario, which, in a sense, filters two out of every 
three versions of a record, but leaves the third behind 
and still viewable. 

For example, imagine a Colors dimension that 
includes Red, Orange, and Yellow members. If you 
use that dimension in the page filter of a pivot table 
and you select only Red and Orange, you Te effectively 
removing Yellow from view. But what if instead of 
removing Yellow, we wanted to change Yellow to 
Black? Or, more specifically, we wanted to mark the 
Yellow records as Removed but leave them visible and 
show their compromised state of Removed. This isn’t 


TABLE |:A Single Record After Triplication 
Litem | Quantity | Record Type | Color | Record Count 


Banana 5 
Banana 5 
Banana -5 
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Actual Record Yellow 1 
Scenario Record Black 1 
Scenario Reversing Record Yellow 1 


typically possible in a cube, which is why we tripled 
the data as our first step. 

To continue with our Colors example and demon- 
strate the purpose of triplication, begin by creating 
three records to represent each original record—an 
actual record (Actual Record), a scenario state record 
(Scenario Record), and a record to mask the scenario 
state record until the scenario is run (Scenario Revers- 
ing Record), as shown in Table 1. 

Roll up the Yellow records by using SQL to group 
on the Item, Quantity, and Record Count attributes, 
as Table 2 shows. If you then put a filter onto the data 
set and excluded all Yellow rows, you'd have fact data 
that appears not to change, except in an indicator 
field, as shown in Table 3 as Record Count. 

Once an indicator field has been developed that 
will change according to the user's selection, that indi- 
cator field can be used to drive the calculation of other 
fields, Note that in the Color dimension example, if the 
user were to select Black as the compromised Color 
value, we would have a strange and messy result. This 
would be equivalent to choosing Big Green Cola as the 
compromised distributor in our cube. We don't want 
to simulate the failure of Big Green Cola; we want to 
simulate the effect of compromised distributors on Big 
Green Cola. Later in this article, I'll describe how we 
got around this issue in a way we felt wasn't ideal but 
certainly sufficient and acceptable to our client. 

Rather than tripling the rows in our fact table, we 
created a view that would use each row in the fact table 
three times. The view can be created in the SQL Server 
data engine or it can be implemented as a Named 
Query in the cube's Data Source View. We thought it 
would be best implemented as a SQL Server Analysis 
Services (SSAS) Named Query; however, we would 
have lost our comments if we had implemented it 
there, so in the sample code we implemented the view 
in the data engine, as shown in Web Listing 1 (www 
.sqimag.com, InstantDoc ID 125818). 


Understanding the Triplicated 
Records Using SQL 

Before moving on to a cube structure, it’s best to first 
perform some SQL queries on our actual relational 
data that show the effect that filtering would have 


TABLE 2: Grouping on the 
Item, Quantity, and Record 
Count Attributes 


| tem | Quantity | Record Count 


Banana 5 3 


TABLE 3: Same Grouping 
After Filtering the Data Set 
to Exclude All Yellow Rows 


[Item | Quantity | Record Count | 
1 


Banana 5 
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on the data because this is what the cube will do to 
create the scenario. If you query the Cases by Loca- 
tion data and group it by the original records using 
RetailLocationID as the marker for an individual 
entry and sum the Cases and Scenario Cases fields, 
you'll see that your data will appear with a Scenario 
Cases total of zero in all the rows, as shown in Web 
Figure 1. This indicates that none of the entries have 
an unhealthy distributor. 

Next, remove a particular distributor from the 
original data set (or simulate its unhealthy state) 
using a Where clause, as shown in Web Figure 2. 
The resulting data set will show that some rows now 
have a value of zero for their number of Cases, and 
the value formerly in the Cases column now shows 
up in the Scenario Cases column. In Web Figure 2, 
the data indicates that RetailLocationsIDs 1 through 
5 are all serviced by Distributor 2 because the only 
distributor compromised in the WHERE clause is 
Distributor 2 and the Cases for locations 1-5 now 
have Scenario Cases instead of Cases. You can then 
use the Scenario Cases value as the indicator field. 
Any row with [ScenarioCases > 0] indicates a retail 
location that has been affected by the current sce- 
nario selected by the user. This example simulates 
how the cube will remove distributors yet retain the 
original records to display information about their 
new states. 


Building the Scenarios Cube 

As previously stated, we built the Scenarios cube 
using dimensions we already had in place for more 
traditional cubes. This made the development time 
very short. Our original Distributors dimension 
was modified only slightly to create what we called 
the Scenario Healthy Distributors dimension. We 
created it using a Named Query in the Data Source 
View, as shown in Figure 2. We used the data from 
the standard Distributor dimension table and 
unioned on a row for the non-distributor that would 
represent any route that Big Green Cola would need 
to find coverage for during a particular scenario. See 
Listing 1 for the code we use in the Named Query 
of our Data Source View, and see Figure 2 for a 
snapshot of that same code in the context of the 
Named Query. 

Our original plan was to have the user select 
unhealthy distributors one-by-one from the dimen- 
sion, but after deciding we would give users access 
to the cube via an Excel workbook, we realized we 
wouldn't be able to use a NOT EQUALS operator 
because of limitations in Excel 2007. We included 
a note to the users telling them to select all and 
then deselect the carriers they want to simulate as 
“unhealthy.” 

For this reason we called the dimension Scenario 
Healthy Distributors, as all items that were selected 
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would be represented as healthy, and all that were 
not selected would be represented as unhealthy. We let 
the users know that this dimension and the wording 
might initially cause some confusion, but that after a 
bit of practice we expected users to feel comfortable 
using the dimension. 

In my earlier example using the colors yellow 
and black, we would have used the Named Query in 
the Data Source View to rename Black to DO NOT 
SELECT. We recognized that users might also be 
confused if we allowed them to use the Healthy Dis- 
tributors dimension as a slicer instead of as a filter. 
We again approached our users about this potential 
problem. They assured us that they were willing to 
overcome the issues with whatever level of training 
was required to help them understand the proper 
use of the dimension. To help reduce confusion, 
we named the dimension FILTER ONLY - NOT 
FOR USE AS SLICER, as shown in Web Figure 3. 
All the other dimensions were built using standard 
methods. 


Name: dim_HEALTHY_Distributors 


Description: 
Big Green Cola (primary) 


Data source: 


Query definition: 


9 Edit Named Query LON መመመ! 


DELICADAS 


[> 


Sort Type Sort Order 


'(.»› DistributorCode + ገ ' + DistributorName | DistributorLabel 


DistributoriD, '(' + DistributorCode + ገ ' + DistributorName AS DistributorLabel 
dbo.dim_Distributor 


0 AS DistributorlD, 'ዐዕ NOT USE AS UNHEALTHY’ AS DistributorLabel 


| 
N 


Figure 2 


A Named Query for the Distributor dimension in the Data Source View 


LISTING |: Code Used to Build the 
Distributor Dimension 


Select 

DistributorID 

, 'ር' + DistributorCode + 'ን ' + DistributorName as DistributorLabel 
from dim_Distributor 

Union 

Select 

as DistributorID 

„ 'DO NOT USE AS UNHEALTHY' as DistributorLabel 
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TABLE 4: Renaming the Relational Field 
Names to the Cube Measure Names 


Original Field Name Aggregation 


Cases Non-Affected Cases Sum 
Scenario Cases Scenario Affected Cases Sum 
RetailLocationlD Retail Location Count Distinct Count 


LISTING 2: Code to Create Three Calculated Members 


CREATE MEMBER CURRENTCUBE. [Measures]. [Total Cases] 

AS ([Measures].[Non-Affected Cases] + [Measures] .[Scenario Affected 
Cases]), 

FORMAT_STRING = "4,9", 

FORE_COLOR = Y /*Black*/, 

VISIBLES 


CREATE MEMBER CURRENTCUBE. [Measures] .[Pct Cases Affected] 

AS IIf([Measures].[Scenario Affected Cases]/[Measures]. [Total 
Cases]>@, [Measures].[Scenario Affected Cases]/[Measures]. [Total 
Cases] ,null), 

FORMAT_STRING = "Percent", 

FORE_COLOR = 255 /*Red*/, 

VISIBLE S 1: 


CREATE MEMBER CURRENTCUBE. [Measures] . [Retail Loc Affected] 
AS Tif([Measures].[Scenario Affected Cases] > Ø, 1, null), 
FORMAT_STRING = "#,9", 

FORE_COLOR = 255 /*Red*/, 

''10511911፳ = ale 


CREATE MEMBER CURRENTCUBE. [Measures].[Pct Locs Affected] 

AS [Measures].[Retail Loc Affected]/[Measures].[Retail Locations Count], 
FORMAT_STRING = "Percent", 

FORE_COLOR = 255 /*Red*/, 

VISIBLE = 1: 


Name: 
[Distributor Health] 


2 [Parent Properties O 


Parent hierarchy: Measures E 
Parent member: 


2 Expression 


Iif ((Measures].[Scenario Affected Cases] > 0, "Unhealthy",null) 


£ Additional Properties 
Format string: “20° " 
Visible: True ~ 
Non-empty behavior: xl 
Associated measure group: ] 


Display folder: 


2 Color Expressions 


Fore color: 255/*Red*/ መ] 


Back color: E 


Figure 3 


Marking retail locations that have an unhealthy distributor 


ርፎ Herardhy Operator 
FILTER ONLY - NOT FOR USE ASSUICER ff SCENARIO HEALTHY Distributors Not Equal 
Select denension> 


Fite Expression 


@ (4500) AB Certan Diştrêxtng 
@ (LEA) Lee, Ed and Haroki Ost 


Non-Afected Cases Scenaro Affected Cases 


omas AUSTIN 2 3 o 

E IRYAN 5; é o 

ህህ Lal * 9 

Grand Total 94 Lal 9 
Figure 4 


Adding HEALTHY Distributors dimension to the Filter area of the cube 
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Adding Measures 

We created three measures by pulling in the Cases, 
Scenario Cases, and RetailLocationID fields from 
our fact table. Then we renamed these measures 
to Non-Affected Cases, Scenario Affected Cases, 
and Retail Location Count, respectively, as shown 
in Table 4. 

Note that when you change the aggregation on 
the Retail Location Count to Distinct Count, SSAS 
will warn you that it’s better to “break distinct count 
measures into separate measure groups.” Indeed, 
it would be a better-performing design to use the 
RetailerLocationID from the dim_RetailLocation 
table. However, for the purposes of this article we will 
accept this less-than-ideal design for now to focus on 
the main topic of this article. Also, note that in the 
more ideal design, the Count function can be used 
instead of Distinct Count, which will be as accurate, 
but better performing, than Distinct Count as we 
have used it. 


Designing the Cube’s 

Calculated Members 

After adding the three measures we created several 
Calculated Members using Listing 2. We start with 
a Text field that will clearly mark those retail loca- 
tions (or routes or zip codes) that have an unhealthy 
distributor, as shown in Figure 3. The [Distributor 
Health] Member will display the word “Unhealthy” 
in red for any entry in which the original entry and 
reversing entry have been removed. When those 
records are filtered out, the Non-Affected Cases 
value will be zero and the Scenario Affected Cases 
value will be greater than zero. The Big Green Cola 
Distribution Risk cube actually includes many other 
calculated members that users can use to perform 
additional analyses on the effects of the impaired 
distributor. These other calculated members are either 
proprietary or are too complex to describe here. The 
Distributor Health measure is the lynchpin of the 
design, and once you've built an indicator field, such 
as our Distributor Health field, you'll find that you're 
able to use the value in this member to drive many 
other members of your own design. 


Using the Cube to Run 
Scenarios 
In our sample cube we included only a few records. 
We provided just enough sample rows to display a 
variety of routes, locations, and distributors. With 
this small record set we're able to illustrate how the 
scenario cube works. 

We begin with the cube showing all distributors in 
a healthy state. Recent averages show that the Bryan, 
Texas, area stores get about 61 cases a week. Austin, 
Texas, stores get about 33 cases a week, as shown in 
Web Figure 4. 
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If we add the Healthy Distributors dimension 
to the Filter area of the cube, like Figure 4 shows, 
and choose the NOT EQUALS operator, we can 
then select the Sure Distribution Company for 
an Unhealthy status. In most dimensions, a 
choice like this would result in the removal of 
entries—the selected item with a NOT EQUALS 
operator would be filtered out by the user's selec- 
tion. In this cube, the user selection changes the 
status of those rows, instead of removing them. 
If we then add more of our calculated members 
and drill the Geography dimension down to 
Zip Code, as shown in Figure 5, we can see the 
effects of the unhealthy distributor on particular 
zip codes. 

The percentage of product and the percentage 
of delivery locations could be rolled up to higher 
levels for each successive group of managers, 
culminating in a single number that would repre- 
sent to a CEO the effect of a distributor's failure 
or success. Conversely, with a few reversing 
adjustments, a tactical user could drill down to 
identify exact locations and alternative distribu- 
tors within the same area, as shown in Figure 6. 
In the event that the company opted to begin 
contingency planning, the tactical user could 
then start contacting distributors based on the 
scenario they had just run. 

For example, imagine the Big Green Cola 
COO hears that the Sure Distribution Company 
is expected to announce it has filed for bankruptcy 
in the next few days. She asks her director of logis- 
tics to identify a percentage effect on operations. 
Her director of logistics can begin at the Region 
level, giving percentages there and instructing his 
state or county directors to drill all the way down 
to the zip code and develop their contingency 
plans complete with cases affected and alternative 
distributors. 

Reports that might have taken many days to 
develop and compile before this cube was devel- 
oped will now take under an hour. During our 
Big Green Cola project, we were able to identify a 
way to give the user an interface to select the dis- 
tributors they wanted to “impair” and have those 
selections affect certain items represented in the 
cube without having the standard, expected effect 
of removing the items from view. After changing 
the items, but not removing them from view, we 
provided a way to easily access and display the 
impaired status of an item (as changed by the 
user selection) and to make use of that recognized 
status change in further calculations. The example 
used in this article is a simple one, but the implica- 
tions for letting a user change fact row data on the 
fly are numerous. SOU 

InstantDoc ID 125818 
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Dimension Hierarchy Operator 
FILTER ONLY -NOT FOR US... $$ SCENARIO HEALTHY Distri... Not Equal 
<Select dimension> 


State Name y City Zip Code Distributor Health Total Cases |Scenario Af Non-Affec’ Pct Cases Affected Pct Locs Affected 
E TEXAS El AUSTIN 73301 20 0 20 
78701 _|Unhealthy 5 5 0 100.00% 100.00% 
78705 8 0 8 
Total Unhealthy 33 5 28 15.15% 25.00% 
EJBRYAN 77801 |Unhealthy 4 21 28 42.86% 25.00% 
77808 12 0 12 
Total Unhealthy 61 21 40 34.43% 20.00% 
Total Unhealthy 94 26 68 27.66% 11.11% 
Grand Total Unhealthy 94 26 65 27.66% 11,11% 
u 
Figure 5 


Effects of the unhealthy distributor on particular zip codes 


Dimension Hierarchy Operator 
FILTER ONLY - NOT FOR USE A... = SCENARIO HEALTHY Distribut... Not Equal 
<Select dimension> 
State Ni v (ዝሃ .- - ፻2ሄ ርዐዐፎ Route ID y Retail Location v Non-Affec Distributor Health Scenario Af 
El TEXAS TEJ AUSTIN TEJ 73301 | EJ LEAH - South Route [Stop for Pop #22 11 0 
Stop for Pop #3 |9 0 
El 78701 |g SURE - Route 8 Quick Mart #28 0 Unhealthy 5 
E 78705 E] OTD - Main Route Beverage Heaven 8 0 
E] BRYAN ጩ 77801 | E] ABCD -Route 1 Payton's Place 4 0 
E] LEAH - East Park Route Stop for Pop #1 17 ዐ 
E] LEAH - Oaks Route Cola Corral 7 0 
E] SURE - Route A Quick Mart #33 0 Unhealthy 21 
El 77808 | LEAH - East Park Route 12 0 


Figure 6 


Identifying exact locations and alternative distributors on a specific route 
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tothe problems of merging access to services across 
multiple domains and ensuring that smart card PKI 
access can be utilized with any web-browser, 
Sqlmag.com/go/F5Kerberos 


Follow us on Twitter at www.twitt 
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The Great Internet 


Mersenne Prime 
Search 


rime numbers fascinated people for centuries and still do. 

This basic concept of an integer greater than 1 and divis- 
ible only by 1 and itself is so intriguing that some spent a big 
part of their career studying primes. Thanks to Euclid we know 
that there’s an infinite number of prime numbers, but we still 
don’t have any formula to produce the nth prime. So every time 
a new largest known prime is discovered it’s an exciting event 
in the scientific community. Now, I’m neither a scientist nor a 
mathematician—I’m just a guy who likes writing SQL queries— 
but I’m fascinated with primes, as a hobby out of curiosity. 

So how big is the largest known prime? At the date of this 
writing, the largest known prime is the Mersenne prime (mean- 
ing it can be expressed in the form of 2%n - 1). The Mersenne 
prime is 2*43112609 - 1, discovered Aug 23, 2008. That's a truly 
extraordinarily large number. Expressed in decimal base, it has 
12,978,189 digits. To get a perspective, let’s calculate how long 
the number would be if written in one line with an average font 
size. Say we can fit about 5 digits per centimeter (about 15 digits 
per inch). This would make the length of our number about 26 
kilometers (about 16 miles)! 

You probably wonder how such a large prime number 
was discovered. The discovery of this, as well as several other 
Mersenne primes, was made by the Great Internet Mersenne 
Prime Search (GIMPS) program. It’s a cooperative effort of 
a network of computers, each having an application installed 
locally, crunching prime numbers, and communicating with a 
central server that assigns the tasks. Anyone with a computer 
can join the effort—visit www.mersenne.org to learn more. 

I should add a disclaimer—as you can imagine, crunch- 
ing potential prime numbers will fully utilize your CPUs. This 
program is configured to run by default with the lowest worker 
priority possible, meaning that if you run other tasks on the 
computer, they will get higher priority. Still, when crunching 
those potential primes your CPUs will be at full utilization. This 
means that you will probably be able to use your computer as an 
alternative to a small heater in the room, and it will be noticeable 
in your electricity bill. Also, if you’re energy green, and it’s more 
important for you to save the world than finding the next largest 
known prime, you would probably rather not run this program. 

Tm running this program on my home laptop. It’s an 
Alienware M15x with a Core 17 processor (Quad with Hyper- 
threading). So I have eight logical CPUs, crunching eight 
potentially record breaking primes. You can see a picture of my 
laptop at work (and at 100 percent utilization) at www.sqlmag 
.com/go/MersennePrime. 

At the top right window of my screen, it says that the esti- 
mated finish time to check the eight primes is October 14, 2010. 
My fingers are crossed and I hope to have good news to report 
soon. 
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Light Up Your Dashboards 
with Silverlight 


here are many advantages to making business 

intelligence dashboards with scorecards, graphs, 
and trends that sparkle and have visually compelling 
elements. It makes your BI solution more accessible 
to all audiences; it makes them more interesting to 
use, increasing adoption; it can help to highlight 
the important areas from mounds of data; and they 
look cool! 

Below is an example from a Microsoft BI cus- 
tomer called Jettainer, who uses the entire Microsoft 
SQL Server and .NET stack and utilized Silverlight as 
part of the company’s enterprise dashboard strategies 
to generate very impressive solutions. 

Derek Comingore and I are going to use the SQL 
Server BI blog to discuss ways you can leverage SQL 
Server, Silverlight, SharePoint, and a few third-party 
components to make exciting visualizations of your 
Microsoft BI solutions that can be displayed on differ- 
ent form factors, from laptops to mobile devices. (To 
read related articles on this topic, visit www.sqlmag 
.com/go/SQLServerBI.) 

Let’s start with a little background on Silverlight 
and the new Microsoft Silverlight PivotViewer con- 
trol. Silverlight is Microsoft’s visualization platform 
technology that lets developers build applications that 
include some of the “cool factor” elements that had 
previously been only available to Macromedia devel- 
opers, including vector graphics, transforms, transi- 
tions, and embedded video, to name a few. Silverlight 
PivotViewer is new 
from Microsoft and 
is a free download- 
able component that 
allows developers to 
organize and display 
mass amounts of data 
from data collections. 
(Visit www.silverlight 
.net/learn/pivotviewer/ 
to download it.) We’ll 
walk you through how 
to use Silverlight com- 
ponents, including 
PivotViewer, to make 
exciting dashboards 
on laptop and mobile 
device screens. 

Let’s establish a 
few guiding principals 
for this series first: 


Autos 


New Cars 


Figure | 


Silverlight PivotViewer 


1. We are going to assume that you are not a 
pure application or web developer and are much 
more interested in building BI solutions and BI 
dashboards. 

2. What this means to me is that you need 
the ease of use that tools like PerformancePoint 
Server and PerformancePoint Services bring. 

3. Therefore, we'll provide code snippets and 
links to sites to get sample code to reuse instead 
of requiring you to write the code yourself. 

4. These dashboards should be available in 
SharePoint to let your entire organization access 
the reports. 


To wrap it up for this intro, let’s keep it simple 
and re-usable. For an overall BI strategy and BI 
solution for your organization, you may want to 
use Silverlight and mobile dashboards sparingly. 
That is, perhaps a landing page that maximizes 
the cool factor. But click-through and drill-down 
may lead your users to a classic PPS dashboard or 
SSRS reports. Next, we'll kick it off with exposing 
your SQL Server BI data sources to Silverlight 
and PivotViewer and the different methods for 
accessing BI data, such as OData & SSRS sources 
for Silverlight and Excel as a source of Power- 
Pivot, keeping things simple and re-usable. 

You can read the subsequent articles in this 
series at www.sqlmag.com/go/SQLServerBl. 


More | Jew Cars Used Cars | Reviews 8 rides | ይርኳያ95 | Videos | Auto Shows | ኮካ REN | My Car Trane | 855 


Average MIG, + 333 lial 


SQL Server Magazine * www.sqlmag.com 


When NoSQL Works Better 
than SQL Server 


hat the heck is the NoSQL movement all 

about, and how does it affect SQL Server 
developers and DBAs? I got the chance to ask 
Kevin Kline, strategy manager for SQL Server at 
Quest Software, and Brent Ozar, a SQL Server DBA 
expert for Quest Software, about this fast growing 
trend at TechEd 2010. (Be sure to check out the 
rest of my interview with Kevin and Brent at www 
.sqlmag.com/go/db-administration.) 


Megan Keller: Brent, there seems to be a NoSQL 
movement taking place. What do our readers need 
to know about it? 


Brent Ozar: There are two common scenarios for 
why you would consider using something other than 
your typical relational database. One is data that is 
not worth very much money. For example, if you're 
tracking web usage amongst all your employees, you 
really don't care if somebody clicked on Farmville, 
MySpace, or whatever. You care, but there's not a 
million dollar value to that data, and there's a ton 
of it, so we want to be able to stash it somewhere. 
But we don't really need to query it that quickly, and 
were going to develop reports against it once and 
then walk away. 

The other use is frequently reloaded analytical 
data. So I might need a whole lot of sales data and 
a whole lot of customer data, and P'm going to pay 
some rocket scientist PhD to slice and dice that with 
his own tools, but at the end of that were going 
to throw that [data] away. We're just going to take 
whatever we learned and use that to make business 
decisions. Both of those are things that need peak 
loads or that are very cheap, and SQL Server doesn't 
really work very well in those situations. 


Kevin Kline: I would even go further and say relational 
databases. If you had Oracle or MySQL, even those 
would not be great at that. 


Ozar: People aren't really rebelling against the SQL 
language itself or any particular properties of the 
database. It's just that they needed something cheaper 
and they can do these crazy loads—these high loads— 
very quickly, and make it very cheap and easy to scale. 
So developers are building tools to make it easier for 
them to store data that way, and much like the cloud, 
they're not coming to DBAs asking questions. They're 
just going to build their own very specialized tools in 
order to get the job done. 
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Keller: So we should expect this movement to grow a 
lot in the next year? 


Kline: Absolutely. In fact, the writing is already on the 
wall. At the [PASS] keynote last year, Dave DeWitt 
spent a great deal of time talking about what a col- 
umn value stores. And NoSQL doesn't stand for “no 
SQL,” it stands for “not only SQL.” So we're not just 
excluding ourselves to only including relational data- 
bases. We're going to look at other ways to look up 
data: key value stores, what's the other one called? 


Ozar: XML columnar storage, XML property bags. 


Kline: Right, so there's several different ways that 
you can get to this data that are better at certain 
types of use cases. So another example might be like 
with Facebook—they use Cassandra, don't they?— 
which is one of the best known. So each one of them 
has different virtues, but the idea is that I need to 
get scaling up to millions of people. That’s where I 
need to be. And I don’t need ACID properties of a 
relational database—Atomic, Consistent, Isolated, 
and Durable. What if it’s eventually consistent so 
that when you post “I’ve got the flu today” it gets up 
there sometime in the next five minutes, but it doesn’t 
have to be there right now? We can cut all kinds of 
shortcuts compared to if we did that with Oracle or 
SQL Server or one of the relational databases. 

And one of things that Dave DeWitt pointed out 
is we have CPUs that are getting so much faster. Intel 
has already announced I think a 64- or 32-core CPU, 
so that is still continuing to accelerate. But our hard 
disks are not getting any faster; really, they kind of 
peaked out there. So we do have some intermediate 
technologies, in-memory data storage, we have SSD, 
that helps, but it’s still not enough to keep up with 
huge advances we're making in CPUs. So what if 
we turn that relational model on its head, we use a 
different model, and we can get to this stuff faster? 
That's again another opportunity that the hardware 
presents for us, just like what we have with VMs, like 
what we are having now with the cloud. And Micro- 
soft is already thinking about how are we going to be 
able to incorporate key value storage, XML storage, 
and so forth as elements of SQL Server in future 
releases. Now DeWitt up and down said, “This is not 
future looking.” It is absolutely future looking. He 
just meant, “Don’t call me on a date.” But it will be 
in there eventually. That’s Kevin Kline talking, that’s 
not Dave DeWitt. 
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Beta for New 
Visual Studio 
Announced 


icrosoft announced the public beta of Visual 

Studio LightSwitch on August 23. Light- 
Switch is a developer tool based on .NET. According 
to Dave Mendlen, senior director, developer tools 
and platform marketing at Microsoft, “LightSwitch 
will be the easiest way to build business applications 
for the desktop and the cloud. We think it’s as easy as 
flipping on a switch.” 

This tool, which enables non-developers to build 
applications, is yet another product from Microsoft 
in response to the burgeoning DIY movement in 
business. The tool consists of prebuilt application 
templates and tools that could be used to build appli- 
cations without code. 

Mendlen points out that the tool is also handy 
for developers who need to create common business 
applications quickly. “With LightSwitch, it’s possible 
to dramatically decrease the time it takes to build a 
custom application because it handles the routine 
code and it lets the developer focus on the business 
logic that makes the developer's application unique.” 
Developers can enhance the templates using CH or 
Visual Basic and can connect SharePoint, Microsoft 
Office (Excel, Word), or any data source (such as SQL 
Server or Oracle databases). Mendlen notes that a 
cool new feature blurs the line between development 
and run mode. The 
developer can test the 
application and mod- 
ify the user experience 
while the application is 
running. 

Business users can 
create applications 
for internal use with 
LightSwitch instead 
of “going rogue” and = ee : 
using tools that a 4 መመ a 
developer can't scale. a RA 
When the applica- 
tion becomes success- 
ful and it's time for a 
professional developer 
to scale it out, this 
task is much easier 
for the developer to 
handle because Light- 
Switch uses the .NET 
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Figure 2 
Visual Studio LightSwitch 
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Microsoft 
LightSwitch 


Framework and Visual Studio under the covers. 
Forrester analyst Jeffery Hammond observes the fol- 
lowing: “Rogue apps happen---they always have and 
they always will. The question is, when a rogue app 
succeeds, how do you formalize its development? How 
does IT take it on board? I’m sure they'd rather take 
on an app that’s already .NET based and has exist- 
ing VS assets than deal with a proprietary 4GL or a 
non-standard development environment. Once you 
recognize the inevitable, you can get on with making it 
a smoother process, and that’s what I think Microsoft 
has done here.” 

Mendlen pointed out that a unique feature of 
LightSwitch is that developers don’t need to think 
about where the application runs until it’s deployed. 
“With a few clicks, the application can run on the 
desktop, in the browser with Silverlight, or your 
application can target the cloud and be deployed 
through Windows Azure and use SQL Azure 
data.” 

The Visual Studio LightSwitch beta will be avail- 
able at the time that this article is published. To learn 
more, visit www.microsoft.com/visualstudio/en-us/ 
lightswitch. Licensing information for this slimmed- 
down Visual Studio product was not available at time 
of writing. ፳፲ 
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DATABASE ADMINISTRATION 
ActiveBatch Expands Jobs Library and Support 
Advanced System Concepts announced ActiveBatch V8.0, which features an improved application's 
job and workflow management capabilities through an expanded Jobs Library and support for Micro- 
soft System Center products. New to ActiveBatch is line of business adapters that can invoke Microsoft 
SQL Server, Microsoft .NET, Windows Component Object Model functions, and Oracle functions 
and procedures, without the need for scripting. ActiveBatch now supports the inclusion of Microsoft ments to products @ 
Biztalk Windows Communication Foundation LOB adapters within the Jobs Library. To learn more, sqlmag.com. 
visit www.advsyscon.com. —Brian Reinholz, 
editorial web architect 
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BUSINESS INTELLIGENCE 


Logi Ad Hoc Releases Ad Hoc Version 10 e 

Logi Ad Hoc has released Ad Hoc Version 10, EZ 1. 00.10: —€ == WAYE 
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ADMINISTRATION Dota Source ; 

Extend SSMS with SQL Source Control armor 

Red Gate Software has released SQL Source Con- ር 4, ዓዓ sep) [_ nan ቁ > 


trol, a tool that lets users control a database from 
within SQL Server Management Studio (SSMS). 
SQL Source Control connects users’ databases to 
either Team Found Server or Subversion. Key features include change tracking and change manage- 
ment overhead. Also, Source Control can be integrated with Red Gate's SQL Developer Bundle. To 
learn more, visit www.red-gate.com. 


BACKUP AND RECOVERY 
Idera Updates SQL safe and SQL virtual 
database 


afe Today 


Idera announced SQL safe 6.4, its SQL Server |” ሮመ AA # SOL 
backup and recovery solution. DBAs can now | "Jos ATRAEN AAA RCN DASÊNA ed 
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execute previous backup operations by choos- 
ing to rerun a single failed policy job or rerun the 
entire policy, and verify backup files directly from 
the SQL safe Management Console. Idera has also 
released SQL virtual database 1.4, a solution that 


lets backup files look and act like an actual SQL cas N : ባቸ oran) 

Server database. With updated features users can e " uc 

run queries, insert and delete data, run reports, and E xazxêzıai 
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Express Products 
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arlier this year, I was lamenting the fact that 

Microsoft has been moving away from small 
business and imbuing its products with a strong 
enterprise orientation, and I noted that these products 
have also become increasingly complex and expensive. 
Although most readers wholeheartedly agreed with 
me, a few readers pointed out that Microsoft is pro- 
viding a great service to small businesses through its 
Express line of products. I’ve written about the new 
SQL Server 2008 R2 Express lineup in InstantDoc ID 
125207. Now, let’s look at the Visual Studio (VS) 2010 
Express suite of products. 

The VS Express products offer big benefits to 
small businesses. And all of the Express products are 
free—even if you're using them to develop commer- 
cial applications. (Download the Visual Studio 2010 
Express products from www.microsoft.com/express/ 
downloads.) The only requirement is that you com- 
plete a registration within 30 days of installing the 
product. 

All VS versions, including the Express versions, 
contain the same compiler as the purchased versions 
of VS. The main difference between the Express 
versions and the different paid versions lies in the 
expanded templates that are provided in the commer- 
cial versions of VS 2010. VS 2010 Express products 
support the new .NET Framework 4, and all of the 
products take advantage of VS 2010’s new IDE with 
its multiple monitor support. 


Visual Basic 2010 Express 

Targeted at beginning developers and hobbyists, 
Visual Basic (VB) 2010 Express enables you to 
develop WinForm (Windows) applications. Despite 
being a free product, VB 2010 Express is a capable 
Windows development platform, and it lets you create 
database applications and perform data binding with 
SQL Server databases. 


Visual C# 2010 Express 

Like VB 2010 Express, Visual C# 2010 Express is 
intended to be used by hobbyists and novice develop- 
ers and enables you to develop WinForm applications. 


It can also be used to create data bound applications. 
The choice of VB or C# is largely just a matter of 
preference. Both offer full access to all of the features 
in their respective languages. 


Visual C++ 2010 Express 

Thave doubts about the real audience for this product. 
C++ has long been the domain for professional devel- 
opers, not beginners or hobbyists. Visual Studio C++ 
2010 Express is the only tool here that’s capable of 
created native unmanaged Win32 applications. How- 
ever, it lacks support for the Microsoft Foundation 
Classes (MFC) and Active Template Library (ATL) 
classes, which are the basis for many commercial 
Visual C++ applications. 


Visual Web Developer 2010 
Express 

Targeted toward beginning web developers, Visual 
Web Developer enables you to develop web applica- 
tions using either VB or C#. Unlike the other Express 
products, it can also be used to create Windows Azure 
applications. No doubt attempting to further con- 
fuse customers, Microsoft has also released an early 
version of its Web Matrix tool, which is also a free 
download targeted toward beginning web program- 
mers. Having tried both Visual Web Developer 2010 
Express and Web Matrix, I can’t see any reason to 
recommend Web Matrix over the more full-featured 
Visual Web Developer. 


Visual Studio 2010 Express for 
Windows Phone 
A new addition to the VS Express line of products, 
VS 2010 Express for Windows Phone enables you to 
develop mobile applications for Microsoft’s upcom- 
ing Windows Phone 7 platform. To get it, you install 
Windows Phone Developer Tools, which includes 
Visual Studio 2010 Express for Windows Phone Beta, 
Windows Phone Emulator, Silverlight for Windows 
Phone, XNA Game Studio 4.0, and Expression Blend 
4 for Windows Phone. [SQL] 
InstantDoc 1D 125748 
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WHAT AREN'T YOU SEEING? 


SQL SENTRY CAN IDENTIFY PROBLEMS BEFORE THEY SNEAK UP ON YOU 


Download the free trial at sqlsentry.net 


SOL Sentry products are made by DBAs, for DBAs. They provide unparalleled insight, awareness and 
control over your SOL Server environment. With features like real-time and historical performance analysis 
and visual schedule management, SOL Sentry is the ultimate monitoring solution. 
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The Cure for SQL Server Performance Problems 
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PERFORMANCE 
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YOU HEADACHES? 


CURE THEM IN 3 EASY STEPS: 


SQL doctor at a server 


to analyze data and identify problems í 
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doctor's expert recommendations 
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